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driven  and  a  Wiener  driven  pare  reduced  co  a  Wiener  driven  process  In 
such  case.  When  X  Is  small,  as  In  Che  case  of  che  man -maneuvered 
spacecraft,  Che  Wiener  driven  model  Is  no  longer  a  good  approximation 
co  che  Poisson  driven  pare  of  che  system.  The  system  can  only  he 
appropriately  modeled  as  a  process  driven  both  by  Wiener  and  Poisson 
processes . 

In  Chls  work,  we  consider  che  state  estimation  problem  of  such  systems 
observed  In  additive  whice  Gaussian  noise.  Compared  Co  the  problem 
for  system  driven  only  by  Wiener  processes,  this  has  received  little 
attention  In  the  literatures,  although  such  systems  arise  relatively  often 
and  naturally  In  practice. 

Among  Che  common  examples,  stochastic  control  systems  that  are  driven 
in  Che  environments  of  optimal  control  strategies  typically  exhibit 
discontinuities  and  "switchings"  In  their  trajectories,  e.g.,  the  bang- 
bang  controls  under  some  time-  and  energy- optima  11 tv  conditions 
[14-16,30],  Estimation  of  such  stochastic  systems  often  Is  necessary 
In  pursuit-evasion  games  and  In  tracking  of  maneuvering  targets  [31,32]. 

To  further  Illustrate  che  wide  application  of  such  a  class  of 
systems  and  the  motivation  for  Its  estimation  problem,  we  give  the 
following  Illustrative  example.  In  off-shore  oil  exploration,  trains 
of  sonic  waves  are  Injected  Into  the  earth's  Interior  underneath  the 
seabed;  reflections  of  such  waves  are  measured  to  estimate  the  physical 
density  of  the  earth's  interior  as  a  function  of  the  depth  below  the 
seafloor.  Should  there  be  no  oil-layers  within  the  depth  of  penetration 
of  che  wave,  Che  density  Is  a  stochastically  continuous  function. 


However,  In  case  chere  Is  oil  underneath ,  lc  is  usually  in  layers  of 
various  thickness  for  example.  Within  the  uncertainty  of  such  aspects 
as  thickness  and  quality  etc.,  each  of  these  layers  which  occur  at 
various  random  spatial  points,  contributes  a  similar  variational  pattern 
(Incident  response)  to  the  density  function.  Hence,  the  density  function 
can  be  modeled  as  a  system  driven  by  a  Poisson  process  with  very  low 
spatial  Intensity.  The  uncertainties  of  the  layers  can  be  modeled  as  a 
random  parameter  associated  with  each  incident,  called  the  mark  of  that 
Incident. 

Another  example  can  be  seen  from  the  following  environmental  problem. 
A  river  Is  occasionally  polluted  by  deposits  of  chemical  products  from 
various  sources  along  the  river.  The  amounts  of  a  particular  chemical 
that  are  deposited  each  time  are  Independent  stochastic  variables  with 
given  distributions.  The  chemical  Is  dispersed  In  the  river  by  diffusion 
and  transport  and  flows.  The  effect  of  such  chemical  to  the  various 
biological  and  environmental  systems  In  the  river  can  be  studied  by 
measuring  continuously  the  concentration  of  the  chemical  and  Its  arising 
effects  at  various  polncs  along  the  river.  The  problem  hence  relies 
heavily  on  the  reconstruction  or  estimation  of  the  times  and  amounts  of 
chemical  and  such  relevant  factors  from  the  measured  dsta.  Such  a 
problem  fits  Into  our  estimation  problem  naturally  due  to  the  Poisson 
nature  of  the  modeled  pollution  deposits. 


4 


As  seen  from  che  above  examples,  Che  class  of  Wiener  Poisson  driven 
systems  encompasses  stochastic  processes  with  discontinuous  sample  paths, 
unlike  the  Wiener  driven  systems  chat  have  continuous  sample  paths 
almost  surely.  In  the  area  of  fractures  and  defects  studies,  and  earth¬ 
quake  analysis  where  che  estimation  problem  arises  naturally,  due  to 
their  Inherent  discontinuities  In  the  sample  paths,  the  model  becomes 
conveniently  applicable. 

Such  systems  for  the  case  of  finite  dimensional  state  model  have 
Markov  property  but  in  general  are  not  Poisson,  nor  Caussian  except  In 
some  particular  cases  [2,3,4].  Due  to  the  non-Gaussian  property  and  the 
discontinuity  of  the  sample  paths,  it  takes  an  Infinite  number  of  Its 
moments  (cuoulants)  to  characterise  Its  stochastic  properties  completely. 
This  Is  manifested  later  in  the  development  of  the  optimal  filter  for 
such  processes. 

1.2.  Problem  Statement 

We  consider  Che  state  estimation  problem  of  the  following  system 
driven  simultaneously  by  a  Wiener  procesa  and  a  Pclason  process, 

dx^  ■  ST^x^dt  ♦  3tdVt  +  b^d*^  t  a  0  (1.1) 

which  Is  observed  in  white  Gaussian  noise,  vis. 

dy£  •  ht*edc  ♦  <*vt  (1.2) 

where  X£  and  y(  are  the  system  state  and  observations  respectively  with 
x^  assumed  to  be  independent  of  [x^,  t  >  0) .  wt*  Vt  *r*  independent 
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generalized  Wiener  processes  with  E(dWt  dW^ ' )  -  (^dt  and  E(dv  dvp  -  Rfdt, 
Qt.  being  positive  semideflnite  and  positive  definite  respectively. 

~c  1*  *  Poisson  process  independent  of  W^  and  with  known  rate  parameter 
1.  Without  loss  of  generality,  we  assuac  6 t ,  3^,  bt  and  h^  to  be  all 
bounded  known  time-cont inuous  functions  for  t  >  0  with 

i  c,  >  0  t  >  o  a. 

for  some  constants  The  assumption  Is  made  to  ensure  the  existence  and 

uniqueness  of  a  solution  to  (1.1)  and  (1.3).  The  problem  under  consid¬ 
eration  is  to  construct  far  each  t  >  0,  an  estimate  of  the  system  state 
*c  on  the  basis  of  observations  £ y ^ .  s<u}  where  u  >  t.  When  u  •  C,  it 
1*  »  filtering  problem;  u  ^  t,  a  smoothing  problem.  The  performance 
criterion  to  be  used  is  the  minimum  mean-squared-error  (MMSE) .  As  k  is 
large,  the  estimation  problem  reduces  to  that  of  estimating  a  Wiener  driven 
system  in  the  presence  of  additive  white  Causslan  noise.  In  this  work,  we 
are  mainly  concerned  with  the  case  where  \  is  small ,  for  which  the  linear 
estimate  may  not  be  as  suitable.  In  order  to  avoid  cumbersome  algebraic 
notations  we  restrict  ourselves  to  the  scalar  case.  The  extension  of  all 
the  results  in  this  work  to  the  vector  case  is  conceptually  straight¬ 
forward  and  can  readily  be  obtained. 

1.3  Suninary  of  Past  Results 

The  general  problem  of  state  estimation  of  linear  systems  driven  by 
Wiener  processes  observed  in  additive  white  Causslan  noise  has  been  treated 
extensively  in  the  past  (33-36]  et  al.  Complete  solutions  to  various 
modified  filtering  problems  are  available  abundantly  in  the  literature  (33- 
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41],  However,  Che  general  problem  of  state  estimation  of  linear  systems 
driven  by  Wiener  and  Poisson  processes  in  the  presence  of  white  Gauss lan 
noise  is  relatively  untreated. 

Linear  systems  excited  by  Poisson  processes  was  studied  recently  in 
the  estimation  problems  [2,3,23).  The  problem  was  solved  by  applying 
results  developed  most  recently  in  martingale  theory  (4-8,20,42-47).  The 
>tiSE  optimal  filters  in  a  form  of  an  infinite  set  of  stochastic  differential 
equations  were  derived.  In  the  presentations  of  the  results,  the  infinite 
dimensionality  of  the  optimal  filters  were  heurlstlcally  argued  to  be 
irreducible  to  finite  form.  Truncation  approximations  and  the  Ritz- 
Galerkln  approximation  methods  (2,3,48)  were  suggested  to  solve  the  set  of 
equations.  Both  in  a  numerical  simulation  study  were  proved  to  be  dis¬ 
appointing  in  terms  of  improvement  on  performance  over  the  Kalman  filter. 
Indeed,  the  author  has  shown  that  the  Kalman  filter  which  utilizes  only 
Che  first  and  the  second  order  statistics,  MKSE  optimal  for  Markov-Gausaian 
systems,  are  extremely  poor  in  performance  for  systems  excited  by  Poisson 
sharked  processes  (49,50). 

In  different  formulations,  random  systems  that  evolve  with  stochastic 
jump  processes  has  been  extensively  attended  (51-57).  The  basic  theoretic 
properties  of  such  stochastic  systems  were  studied  in  great  depth  [7,8,12,27] 
51)  at  al.  Most  of  the  works  in  area  of  estimation  however  centered  mainly 
on  Che  detection  of  jumps,  sudden  changes  of  stochastic  properties  of  the 
systems  and  on  the  renewal  processes  associated  with  the  point  processes 
imbedded  in  the  systems  (58-87).  Under  different  criteria  on  performance, 
the  approach  of  combined  estimation  detection  schemes  [68-72]  were  suggested 
for  estimating  and  detecting  the  jump  processes.  A  good  collection  of  such 
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combined  schemes  based  on  various  criteria  and  structures  of  the  problems 
are  available  (68-72). 

Related  estimation  problems  on  stochastic  systems  driven  by  gener- 
allied  Poisson  processes  have  received  extensive  attentions  [73*76]  et  al. 

In  most  cases,  the  estimation  efforts  were  concentrated  on  the  estimation 
of  the  average  Poisson  intensities  of  the  driving  point  processes  based  on 
the  noisy  observations  of  the  driven  systems. 

1.4  Chapters  Review 

We  first  derive  in  Chapter  2  the  (MMSE)  optimal  filter  for  the  problem. 
The  approach  will  be  closely  parallel  to  chat  in  [2,3],  via  the  Do 1 e an - Dad e 
and  Meyer  decomposition  rule  for  discontinuous  semi-martingales  and  its 
corresponding  filtering  theorem  [4,5,6].  The  different  aspects  of  the 
filter  is  discussed  and  investigated.  Its  performance  is  compared  to  that 
of  Che  linear  optimal  filter. 

In  0\apter  3,  we  investigate  the  problem  for  the  system  driven  only 
by  Che  Poisson  processes  i.e.  ,  3^  ■  0.  Since  the  estimation  problem  for 
the  system  driven  only  by  Wiener  processes  has  been  widely  studied  and 
developed,  this  renders  a  better  understanding  and  insights  into  the  general 
problem.  A  suboptlmal  sequential  scheme  (SSS)  based  on  a  combined  estimation- 
detection  approach  is  developed  for  this  particular  problem.  A  compensation 
strategy  is  adopted  in  the  SSS  to  eliminate  the  propagating  error  due  to 
the  sequential  structure  of  Che  SSS.  The  asymptotic  performance  for  the 
time -invar lent  case  is  derived  and  numerically  simulated  on  a  digital 
computer. 
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In  Chapter  4,  the  general  estimation  problem  is  considered.  The  SSS 
is  modified  and  extended  as  a  subopt iaal  solution  to  the  problem.  The 
different  aspects  of  the  SSS  and  its  extension  for  other  systems  will  be 
presented. 

In  Chapter  5,  we  extend  the  estimation  problem  of  the  Poisson-Wiener 
driven  linear  systems  to  a  wider  class  of  stochastic  systems,  namely  the 
conditional  Gaussian  systems.  We  derive  the  basic  equations  and  functionals 
that  are  essential  for  the  development  of  the  SSS  as  a  subopt iaal  approach 
to  the  estimation  problem.  While  the  derivations  of  the  SSS  will  be 
siB<’.  »r  as  those  in  Chapter  4,  we  will  not  get  Into  the  detail  derivations 
of  the  SSS  here. 

Chapter  6  concludes  this  thesis  with  a  summary  of  the  results  and 
conclusions  of  the  previous  chapters. 
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CHAPTER  2 


THE  OPTIMAL  FILTER 


2.1.  The  Optimal  Filter 

For  algebraic  simplicity,  we  consider  Che  scalar  case  of  the  system 
slightly  modified  from  (l.l),  given  as  follows. 


d»t  ■  ^x^dt  ♦  S^dW^  ♦  bt^t 


^  -  T\ 

t  t 


t  >  0 


(2.1) 


(2.2) 


The  principal  effect  is  a  deterministic  change  in  the  mean  function  of 

the  system  x^,  t  >  0  which  does  not  affect  Che  essence  of  our  problem. 

The  approach  we  use  Co  obtain  the  optimum  filter  is  via  Che  semi* 

Jux* 

Invarianc  generating  function  for  x,  Zn£(e  ).  First  apply  the  differenti¬ 
ation  rule  of  Dolean-Dade  and  Meyer  (see  Appendix  A)  for  discontinuous  semi* 
Juxr 

martingales  to  e  ,  putting  it  in  the  standard  semi-martingale  form. 

Then  we  apply  the  optimal  filtering  theorem  for  white  Gaussian  observation 
noise  [2-6]  (see  Appendix  B).  The  differentiation  rule  results  in 


<  „e.  „c  >t 

ju(x  ♦  b  )  Jux  Jux 

.  j  ®  a--  — 


♦  d  £  (e 
0  <  s  <  t 


jue  b#)  (2.3) 


where  <xC,  *C  >t  is  Che  increasing  process  associated  with  Che  continuous 
part  of  the  semi-mart  ingale  x  t  >  0  [2,7.8].  The  emanation  is  carried  out 
over  those  values  of  s  where  x  jumps.  Here  J  “  v*l  and  u  is  real.  Considering 


the  continuous  part  of  x,  we  have 


<xc,xc>t  -^Qtdt 


(2.6) 


and  also  discontinuous  part,  we  have 
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d  Z  1<X.  )  -  4  f  9(X ,)d\  -  l(X,)dT\ 

.  .  .  S*  w  .  S  S  t  t 

0  <  s  <  C  0 


(2.5) 


Substituting  of  (2.4),  l2.5)  Into  (2.3),  one  obtain* 

Jux  Jux  Jux 

<le  -  liutf^e  ♦  2^u)^tV  ldt 

jux  jux  Jub  Jux  jub 

♦  jue  C5tdWt  ♦  e  (e  -l)d*>t  ♦  \e  (e  -l)dt  (2.6) 


which  Is  In  a  form  such  that  the  optimal  filtering  theorem  is  readily 

applied.  Let  ?c  denote  the  conditional  expectation  of  the  random  variable 

with  respect  to  the  growing  3-field  generated  by  {  y^.O  5*1  0«  Then 

applying  the  filtering  theorem,  we  have 

/N  /\ 

'jux  ^  jux  ,  ,  'lux  lux  jubt 

de  »  [ju^x^e  ♦  7(Ju)  )dt  ♦  (•  *l)dt 


‘/'ju*t  -  /^*t  -l 

♦  ht(xt*  -  x£e  Jvc  [dy t  -  h^J  dt 


(2.7) 


If  we  substitute  the  cumulant  generating  function,  viz. 

</^ju*T  •  it  \k 

e  -  exp  (  Z  C  ] 

k-l  t  ** 

where  C.^  Is  the  k-th  conditional  cuoulant  of  x£  Into  (2.7) 
we  obtain  the  following  set  of  equations, 

"u  •  w  •  viV4’,  •  \ci««> 


dC,_  -  C,  dt  ♦  .dt  ♦  Xb? 


*2t  t  2t 


v;  •  htcit',t) 


«v  “'.v  *  -  i  hi*;1  £<;>  vuVn 


(2.8) 


(2.9a) 


(2.9b) 


^  ^  *  J|^»5s  see  (2*9C) 
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The  conditional  expectation  of  and  its  variance  denoted  by  are 
therefore  given  by 


x 

t 


-  C 


It 


P 


2t 


'2t 


(2.10) 

(2.11) 


Note  that  k  •  2,3,4,...  arc  all  Invariant  to  any  change  in  the  mean 
function  of  the  system  x^. 

From  Eqs.  (2.9a)  and  (2.9b),  we  see  that  the  optimal  estimate  has 
precisely  the  form  of  the  Kalman  filter,  except  that  is  observation- 
dependent.  The  differential  equation  for  involves,  besides  lower- 
order  cumulants,  also  £  implying  the  optimal  filter  requires  solving 

an  infinite  set  of  simultaneous  stochastic  differential  equations. 

There  are  many  processes  tn  which  the  optimal  estimates  exist 
in  closed  form.  Among  the  common  is  the  Gaussian  case,  where  the 
additional  relation 

»  /rf'  -3 

xt  "  3xtxt  “  2xt  (2.12) 

enables  us  to  reduce  the  set  of  Eqs.  (2.9  )  into  a  closed  system  of 
equations,  the  Kalman  filter  (7). 

2.2  ThfL  Nonclosedness  of  the  Optimal  Filter 

In  order  to  study  the  property  of  closedness  of  the  filter  (2.8),  we 
define  the  process  which  drives  the  optimal  filter  as  follows, 

«K  ■  dyt  *  ht*tdt  ■  «lyt  *  htcitdt  (2.13) 

If  the  process  I  exists  in  a  well-defined  manner  then  it  is  the  innovation 
process  with  respect  to  the  observation  in  our  estimation  problem  (7-llJ. 

In  such  case  it  is  a  Wiener  process  with  respect  to  the  o-fleld  generated 


r 
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by  Lys ,  0  <  s  <  t}.  Furthermore  its  stochastic  properties  are  identical 
to  those  of  the  Wiener  process  v  tn  the  observations.  It  is  called  an 
Innovation  process  because  it  carries  and  hence  can  generate  the  same 
"informat ion"  as  the  observations. 

Note  that  solutions  to  a  closed  set  of  equations  containing  states 
driven  by  Wiener  processes  are  all  sample -continuous  with  probability  one, 
provided  they  exist  (7,8).  Hence  if  there  exists  an  additional  relation¬ 
ship  in  our  case  as  Eq.  (2.12)  in  the  Gaussian  case,  which  enables  us  to 
reduce  the  set  of  Eq.  (2.9)  to  a  closed  set,  the  optimal  estimate  and  its 
cumulants  as  solutions  to  the  reduced  set  of  equations  will  be  all  sample- 
continuous  with  probability  one.  Since  the  system  we  are  estimating  is 
discontinuous  at  various  random  times  (termed  as  Harkov  times  [7])  and  the 
optimal  estimate  being  instantaneous  estimate  of  the  system  should  exhibit 
a  highly  "discontinuous"  sample  path,  we  deduce  that  the  set  of  Eqs.  (2.9) 
is  not  closed,  containing  an  Infinite  number  of  equations. 

Then  the  question  of  existence  of  an  optimal  estimate  as  solution  to 
such  a  non-closed  set  of  equations  naturally  arises. 

Not*  that  to  carry  the  same  "information"  as  the  observation  y,  the 
J-fleld  generated  by  the  innovation  process  I  has  to  be  equal  to  that  gener¬ 
ated  by  the  process  y  for  all  t,  0  <  t  <  -.  That  is,  if  we  let  5*  and 
denote  the  cr-fields  generated  by  1  and  y  0  <  s  <  t  <  •  respectively, 
by  definition  of  the  innovation  process,  the  following  must  hold  for  all  t 

*1  •  0<t<«  (2.14) 

Since  7*  is  generated  by  the  innovation  process  which  is  a  Wiener  process, 
it  is  continuous  in  t,  viz. 


t  * 


(J 


*1 

T 

1 

I 
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I 

r 

I 

IT 

u 

0 

u 

0 

L 

r 


c 

D 

0 

P 

1, 

p 


Ha  51  .  um  y1  .  j1  0  <  t  <  •  (2.15) 

,Tt  4  sic  *  C 

However  5^,  generated  by  the  observation  process  y  which  contains  a 
discontinuous  process,  is  only  right -continuous.  With  non-zero  prob¬ 
ability,  is  not  continuous,  l.e. 

lim  5*  +  5^  for  some  t,  0  <  t  <  ■  (2.16) 

Therefore,  froa  the  above  argument  and  Eq.  (2.15),  (2.16)  we  establish 

*  5*  0  <  t  <  -  (2.17) 

which  contradicts  the  assumption  of  the  process  1  being  the  innovation 

y 

process  with  respect  to?c>viz.Eq.  (2.14).  And,  by  definition  of  the 
process  1, 

(2.18) 

through  the  construction  of  I. 

Therefore,  the  process  I  is  not  an  innovation  process  and  it  carries 
less  information  than  the  observation  y.  This  implies  the  process  I  does 
not  exist,  and  from  Eq.  (2.13),  it  follows  that  x  does  not  exist,  or  at 
best  exists  in  a  certain  stochastic  limit  sense,  and  not  in  the  strong  sense. 

To  summarize,  the  optimal  estimate  is  given  in  terms  of  a  set  of 
infinite  number  of  differential  equations.  Because  of  the  non-closedness 
of  the  set  of  equations,  the  optimal  estimate  may  not  exist  and  at  best 
it  can  be  represented  as  a  limit  in  some  stochastic  sense. 

The  investigation  of  how  such  a  limit  behaves  is  a  difficult  problem 
and  does  not  bear  any  significance  to  our  problem  at  hand.  From  a 
practical  point  of  view,  the  non-c losedness  of  the  set  of  equations  has 
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nude  Che  imp lementac ion  of  such  an  estimate  formidable.  It  should  be 
apparent  that  simple  truncation  of  the  set  results  in  a  continuous 
estimate  for  x^  ,  generally  of  very  poor  performance  because  it  is 
required  to  make  instantaneous  estimation  of  a  discontinuous  system. 

Such  truncation  may  also  lead  to  a  possibly  unstable  filter. 

2 . 3  The  Optimal  Filter  and  the  Kalman  Filter 

We  Investigate  the  performances  of  the  linear  optimal  filter,  i.e., 
the  Kalman  filter,  and  Che  optimal  filter.  In  particular,  their  asymptotic 
performances  when  \  is  small  are  compared. 

The  Kalman  filter  which  utilises  only  the  first  and  second  order 
statistics  can  be  readily  obtained  and  is  given  by 

d3fe  -  c7t^tdc  +  \v[lP(t)(dyt  -  ht5ftdt)  (2.19a) 

dp(t)  -  [2iCrt  P(t )  +  Xb‘  ♦^Qt  *  hVlP2(t)]dt  (2.19b) 

where  and  P(C)  denote  the  Kalman  estimate  of  Che  system  and  its 
variance  respectively.  Notice  that  the  variance  equation  is  realization- 
independent  and  can  be  solved  ahead  of  time.  Although  the  Kalman  estimate 
always  exists  as  solution  to  the  Eq.  (2.19a),  dy&  “  h^^^dt  is  not  an 
innovation  process  with  respect  to  the  observation  since  is  not  an 
optimal  estimate.  It  should  be  apparent  that  any  random  process  with  only 
the  same  first  and  second  moments  as  x(  will  have  the  same  Kalman  filter. 
Define  now 

S  -  l.l.m.  $  (2.20a) 

\  -•  0 

1  m  l.l.m.  (2.20b) 

*  1  0 

where  the  derivative  is  taken  in  the  quadratic  mean  sense. 
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As  X  *•  0,  it  c *n  be  seen  easily  that  the  optimal  filter  given  by 
Eq.  (2.9)  reduces  to  the  closed  form  of  the  Kalman  filter  given  by  (2.19) 
with  the  assumption  of  Gaussian  initial  condition  of  x^.  We  have 


V, 


(2.21.) 


.2  -1*2 

'  Vt  C2c 


(2.21b) 


in  which  case 


and  £.  -0  t  >  0  for  k»  3,4,3 . 

“t  “ 

Taking  derivatives  of  Eq.  (2.9)  and  using  Eq.  (2.22c),  we  have 


(2.22a) 

(2.22b) 

(2.22c) 


iSlU  ■  ^uc1'  *  V2lt2»tdIt 

iEttl  '  k'*c  '  h!''t‘f‘t>l5kXt'lt  *  ‘V"' 

*  htVCl\+l,AtdIt  k  ■  2,3,4,...  (2.23b) 

Suppose  we  write 

cu  •  'it  *  l'ut  ♦  7  «•»*> 

^  \Xt  ♦  °(X2)  (2.24b) 

Hence,  from  Eq.  (2.5),  we  have  for  the  difference  between  the  two  estimates 
5t  *  Cu)  -  A  -  CUt)  ♦  o (X)  (2.25) 

Combining  Eq.  (2.19)  and  (2.23),  we  first  have 
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*  Vt'lIV'>  •  *JUldIt 


(2.26) 


Let  us  confine  ourselves  to  Investigate  the  asymptotic  values,  for  t  ». 
Assume  there  exists  t’  <  ■,  such  that 


ht\l\(t)  ^  Kl <  0 

for  t  >  t' 

(2.27) 

hVlP(t)  <  Kj  <  0 

for  t  >  t' 

(2.28) 

which  Is  readily  satisfied  by  any  stable  systems. 
Therefore  from  (2.25)  and  (2.26),  we  have 

llm  E(?c-Clt)  -  llm  \  EOC^  -  Z^)  +  o(X)  -  o(\) 
Similarly,  we  have 


v.  2  2  V  -  2  .2 

EOCt  *  Cl£)^  -  k  E0\c  -  Cut)‘  ♦  o(X  ) 

^  2 

Applying  the  differentiation  rule  to  (X.  t  -  C^Xc)  ,  we  obtain 

"Wu^U,’2  ■  2<Vhi\Vt>>(*u-!:iU>2<‘t 

♦  ht\l|p2(t>  •  4tldt 

*2<\t 

Hence  If  the  system  (2.31)  Is  asymptotically  stable 
_ X  ~  .2  *  2-  r  *  v2  .  -2. 


tu.  E(xt-clt)  -tu.  •  iut)  *.p.) 


-h2.-1 

- Hn - 

2*vVc  Px(t)> 


4t>  *  0(K2) 


(2.29) 


(2.30) 


(2.31) 


•  llm 


(2.32) 


However,  we  also  have 


P(t)]  [r  (t)  -  l. 


Slmi  lar  ly 


Continuing  the  process,  we  deduce  that  as  t  Is  large 


where  a 


Observe  that  the  sum  given  by  Eq.  (2.35)  Is  a  convergent  series,  because 


the  ratio  of  the  (k  +  l)st  term  to  the  k-th  term  is 


uniformly  as  k  -*  •  for  t  >  t ' .  The  behavior  of  the  limit  of  the  mean  squared 
difference  (2.32)  therefore  depends  on  the  behavior  of  b  . 


First  for  small  b*  as  C-**,  as  expected 


u®  z(\  -  cltr  -  o  ♦  oa“) 


(2.37) 


In  such  case,  Che  optimal  filter  reduces  to  the  Kalman  filter  asymptotically 


as  X  -•  0. 


However  for  large  b‘,  It  Is  easily  deduced  chat 


£  '<V  li»>  ~  £  •  i<v  Vi  Vt>1(— 


.2  -1.2 
htVt  bt  2 
2°t 


(2.38) 


Therefore  for  small  \  but  large  b* 


E<VCU>' 


(xb2)2 


large  t 


(2.39) 


which  Is  the  square  of  the  power  of  the  Poisson  driven  part  of 

the  systca.  Sow,  let  us  study  the  performance  of  the  optimal  filter  in 

* 

terms  of  Its  error  distribution  over  the  Wiener  driven  part  and  the  Poisson 
driven  part.  Assume  further  that  the  system  Is  stable  and  has  a  steady 
state  after  some  time  t‘  >  0. 

From  Eq.  (2.23b),  we  have 


11®  E(C2U) 


2«?t  * 


(2.40) 


Writing  C2c  •  C2t  ♦  +  0<\) 

-  >(t)  ♦  \i2K  ♦  oa) 


(2.41) 


For  large  t,  we  have 


I 

▼ 

i 

*- 

It  4 

li 
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E(C2c> 


-  f(t)  + 


-Xb‘ 


2^-  v; 


ixt)j 


o(X) 


(2.42) 


The  first  quantity  on  che  right  side  of  the  equation  Is  the  variance  In 
filtering  the  Wiener  driven  part  i.e.(  the  continuous  part  of  the  system. 
The  second  quantity  is  that  due  to  the  filtering  of  the  Poisson  driven 
part.  Notice  that  it  depends  on  ?(t). 

Comparing  the  error  variance  in  filtering  Che  Poisson  driven  part  of 
the  system  to  its  variance,  we  have  che  ratio,  denoted  by  c. 


-u>: 


■Xb 


«  ■  11m 


t—  2  (tf 


hVlP(t)] 


■  lim 


C—  £?c-  hVl?(t) 


(2.43) 


Hence  when  there  is  no  Wiener  driven  part  in  the  system, 

*(t)  ■  0 

making  c  •  1  (2.44) 


In  fact  it  is  shown  by  the  auchor  that  for  system  driven  only  by  Poisson 
Inputs,  the  steadystate  error  variance  of  the  linear  optimal  filter  is 
at  least  as  large  as  the  variance  of  the  system  (12).  This  manifests  the 
fact  that  the  optimal  filter  and  che  Kalman  filter,  both  being  Instantaneous 
filters  fail  to  estimate  che  jumps.  This  can  also  be  seen  by  noting  that 
If  the  system  Is  a  fast  decaying  system  i.e.  |^J  Is  large,  the  filter 
falls  to  estimate  the  jumps,  giving  *  -♦  1.  And  the  quantity  In  (2.43) 


n 
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2  -l 

h  v  denote*  the  signal-to-noise  ratio  in  the  observations.  As  it 
t  t 

increases,  the  error  decreases.  From  Eq.  (2.43),  the  ratio  c  decreases 
as  $(t)  increases  for  fixed  power  of  the  Poisson  driven  part.  This 
appears  at  first  to  be  contradictary.  This  is  due  to  the  fact  that  ?(t) 
only  increase*  with  the  power  of  the  Wiener  driven  part  of  the  system. 

If  the  energy  of  the  Wiener  part  in  the  system  dominates  that  of  the 
Poisson  driven  part,  the  error  made  by  the  filter  will  be  dominated  by 
the  estimate  error  of  the  Wiener  part,  i.e.  ?(t).  This  is  a  direct 
consequence  of  the  ratio  of  the  power  of  the  Wiener  part  and  that  of  the 
Poisson  part  in  the  system  itself.  Hence  in  systems  where  the  Wiener 
part  dominates  over  the  Poisson  part,  a  reasonable  approximate  filter 
can  be  constructed  by  simply  neglecting  the  Poisson  part  in  the  system. 

In  case,  the  opposite  is  true,  the  power  of  the  Poisson  part  dominates, 
both  of  the  Kalman  filter  and  the  optimal  filter  performs  very  badly  as 
an  instantaneous  estimate  of  the  system. 

2.4  Cone lus ions 

In  this  chapter,  ve  derive  the  optimal  filter  for  the  problem.  We 
deduce  chat  the  set  of  equations  defining  the  optimal  filter  is  infinite 
(non-closed)  by  arguing  chat  there  is  no  additional  relationship  that  would 
reduce  the  set  to  a  finite  set.  If  such  a  relationship  exists,  then  we  are 
guaranteed  to  have  a  solution  in  the  strong  sense  for  because  the  finite 
set  of  equations  in  our  case  is  guaranteed  to  have  a  strong  solution,  In  such 
case,  the  process  I  defined  in  (2.13)  is  the  innovation  process,  and  hence  a 
Wiener  process  with  respect  to  the  ff-field  generated  by  the  observations.  For 
a  finite  dimensional  vector  differential  equations  as  the  reduced  filter 
equations  driven  by  a  Wiener  process,  the  solution  is  always  continuous  with 
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probability  one.  Hence  we  would  have  a  continuous  estimator  for  a  dis¬ 
continuous  system,  contradicting  the  optimality  assumption  of  the  estimator. 

We  further  argued  that  the  solution  for  Cjt  does  not  exist  in  the 
strong  sence  by  showing  that  I  in  fact  is  not  an  innovation  process  in 
our  problem.  The  optimal  filter  is  unrealizable  and  unlmplementable. 

For  the  performance  of  the  optimal  filter,  we  compare  it  with  chat  of 
the  linear  optimal  filter.  At  least  for  Che  case  of  stable  systems,  the 
improvement  in  performance  of  the  optimal  filter  over  the  linear  optimal 
filter  is  extremely  small  when  X  is  small. 

We  shall  show  chat  the  linear  optimal  filters  (causal  and  non-causal) 
which  utilize  the  first  and  second  order  statistics  only,  perform  extremely 
poorly  in  the  estimation  problem  of  the  Poisson  driven  systems.  Hence  the 
optimal  filter  being  a  causal  estimator  is  not  suitable  for  the  state 
estimation  problem  for  Poisson  driven  systems. 
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CHAPTER  3 

ESTIMATION  OF  THE  POISSON  DRIVEN  PROCESSES 

3. 1  General 

In  this  chapter  the  optimal  filter  and  suboptimal  approximation 
estimation  scheme  for  the  Poisson  driven  case  will  be  considered.  It 
can  be  easily  seen  chat  by  setting  K«0  in  the  optimal  filter  Eq.  (2.9), 
the  problem  reduces  to  the  standard  linear  quadratic  Gaussian  problem 
and  che  filter  reduces  to  the  standard  Kalman  filter  which  is  finite 
dimensional  (closed).  The  none losedness  of  the  optimal  filter  (2.9)  arises 
from  the  Poisson  driven  part  of  the  system  due  to  its  non-Gauss lan  property 
and  hence  an  infinite  number  of  moments  are  required  to  characterize 
completely  che  filter. 


The  optimal  filter  for  the  purely  Poisson  driven  process  may  be  ob¬ 
tained  from  Eq.  (2.9)  by  setting  •  0  and  is  given  as  follows  [2,3,28] 


Gc.tdt  *  W  cJta‘t 


O.l.) 


♦  h  v  C.  ,  di 
t  c  k+i,c  c 


k  •  2 ,3,4. . . 


(3.1b) 


Note  that  the  filter  as  in  Eq.  (2.9)  is  still  infinite  dimensional.  The 
Wiener  part  indeed  only  comes  into  play  in  che  variance  equation,  i.e. 
the  equation. 

The  problem  may  be  generalized  by  including  a  mark  process  generated 
by  che  Poisson  driving  input.  The  problem  in  this  case  is  a  modification 
of  (2.1)  and  the  process  is  defined  by 
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dXc  "  i7cXcdt  +  t  >  0  (3.2a) 

where  U  denote*  the  mark  of  the  Poisson  process  and  Is  assumed  to  have 
a  known  probability  density  f(U).  By  definition  of  the  mark,  the  random 
variable  U  will  be  generated  independently  according  to  f(U)  at  every 
incident  of  the  Poisson  process  T)  ,  The  mark  can  be  considered  as  a  random 
parameter  generated  by  each  Incident,  e.g.,  the  marks  represent  the  random 
magnitudes  of  the  Poisson  Impulses. 

Let  the  l-th  incident  time  and  mark  of  the  Poisson  process  be  denoted 
by  respectively.  Also  denote  the  number  of  incidents  over  the  semi- 

closed  interval  (s,t)  by  N(s,t)  or  if  s«0,  N(t)  for  simplicity. 

The  objective  is  to  obtain  s  near-optimal  estimation  scheme  for  * 
from  noisy  observations  vis. 

dyt  *  ht*cdc  ♦  dvt  (3.2b) 

3 . 2  The  Optimal  Linear  Filter 

As  indicated  in  Chapter  2,  the  Improvement  of  the  optimal  filter 
(which  is  unlmp lament ab le)  over  the  linear  optimal  filter  is  relatively 
small  in  terms  of  error  variance.  We  establish  in  the  section  that  the 
linear  optimal  filter  for  the  Poisson  driven  case  with  low  intensity  la 
extremely  poor.  Consequently  both  of  the  Kalman  filter  and  the  optimal 
filter  are  unacceptable  in  their  performances.  This  Is  basically  due 
to  the  fact  that  the  system  process  to  be  estimated  is  only  right -cont lnuous , 
making  instantaneous  estimation  or  detection  of  the  jumps  in  its  sample 
paths  formidable. 
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From  Eq.  (2.44),  we  have  seen  that  the  normalized  error  variance  of 
the  optimal  filter  tends  to  unity  as  X  -*  0.  Now  we  show  that  in  the  non* 
causal  linear  optimal  smoothing,  the  steady  state  normalized  error  variance 
for  the  time  invariant  case  also  tends  to  unity  uniformly  as  X  -*  0.  This 
result  indicates  that  an  improvement  over  the  optimal  linear  noncasual 
filter  may  be  possible  only  if  a  time-delay  is  allowed  in  a  nonlinear 
filtering  scheme.  In  order  to  present  the  derivation,  the  system  (3.2a) 
will  first  be  reformulated  and  expressed  as 


dxt  -  tfex£dc  +  '  b£(U)  M(dt ,dU) 
U 


(3.3) 


where  M(<)  denotes  the  measure  of  the  underlying  Poisson  marked  process. 
Let  $(t  ,s)  denote  the  state  transition  function  of  the  homogeneous  part 
of  the  system,  then 


x 


t 


$(t,0>x0 


t 

+  '  '  $(0.8)  b  (U)M(ds.dU) 

6  & 


Hence  from  Eq.  (3.2b),  the  observations  become 


(3.4) 


y.  - 


vt  +  ht*<c»°>*0 


ht*(t,s)b<(U)M(ds,dU) 


(3.5) 


By  definition  of  the  stochastic  counting  Integral  whose  existence  and 
uniqueness  only  requires  the  point  process  M  to  be  conditional  orderly 
and  with  probability  one  have  a  finite  number  of  points  in  a  finite  Interval 
[17,18,29]  which  are  all  satisfied  by  our  Poisson  assumption  of  *\  ,  we 
obtain  the  following 

N(C) 

x  -♦(t,0)x  ♦  £  g(t)  t  ;u  )  t>0  (3.6) 

l-l  11 

where 

g(t,r;u)  -  ♦(t,T)b£(u)  (3.7) 


Is? 


ii 


1 

1 

1 

1 

I 

1 

1 

1 
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which  is  Che  integrand  in  Che  integral  in  (3.4). 


If  we  assume  is  known  (this  assumption  may  be  removed  without  diffi¬ 


culties).  a  new  observation  process  z^  may  now  be  defined  by 
*t  “  yt*  hc*<t>°>xo 


where 


N(C) 

-  v  +  £  h(c,r  ;u  ) 

i-1  1 

N(T) 

-  v  +  E  h(t,T  ;u  ) 

i-1 


h  (t  ,r  ;u)  -  ht«&(t,T)bt(u) 


t  >  0 


0  <  t  <  T 


(3.8) 


(3.8b) 

(3.9) 


Equation  (3.8b)  follows  from  the  causality  of  g  and  h  in  the  integral,  viz 


g(t,*;u)  -  0  •  h (t ,r ;u) 


for 


t  <  t. 


(3.10) 


We  now  examine  Che  performance  of  linear  optimal  smoothing.  The  case 
of  finite  interval  and  infinite  interval  linear  smoothing  are  both  con¬ 
sidered  . 

For  the  fixed  interval  smoothing  over  an  Interval  of  length  T,  the 
steady  state  error  variance  for  the  optimal  linear  smoother,  denoted  by 
is  given  from  [19,  pp.256);  as  follows 


i  „  i  -1 

*T  ■  F  .J,™*  •  F  '  ;t<») 


dtv  ♦  j  G(C)l  dt 


where  3^,  ^  are  the  spectral  densities  of  x(t)  and  z(t)  respectively, 

Che  cross  spectral  density  of  x(t)  and  z(t).  G(t)  is  an  appropriate  time- 

function  derived  from  the  spectral  densities.  Therefore 

2 


i  •  i  •  I  *«.(•)! 

«_  >  «  •  2  (id) do)  -  —  —  ■  —  da  . 

T  -  »  2^«x  IF  •  $  (id) 

•*  ••  z 
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The  substitution  of  the  spectral  densities  with  the  appropriate  expressions 
in  terms  of  the  transforms  of  Che  Incident  response  [1],  and  using  the 


notat ion 


h‘(C,T;u)  -  E  h2(t,T;U) 


results  in 


^  1  * h..,.  -i2.  1  r  X2|h()w;u)!  <**• 

«T  -  2n  .  dw  -  —  J  fU  1 - 7“? 

1  *  —  ^  —  5  +X|H(Ju);u)r 

normalization  with  the  signal  variance  which  is  given  by 

OB 

Var(X(t))  -  X  '  h2(t-T,u)d* 


(3.11) 


finally  yields 


J"ls(j-Al1*.  -  £  J*  , /JV 

**  (^-)  + 1  H(J(«;u)|  j  **  (3.12) 

The  behavior  of  the  normalized  error  variance  as  X  -♦  0  becomes  therefore 


«  &  I  U(**>;u)l2d* 

1  irn  - 1 - >  - — - 

X.0V"V  '•*.2.  T  -  . 

h  (t;T,u)d- 


(3.13) 


where  the  Pareseval's  Theorem  has  been  used.  From  Eq .  (3.13)  we  can  deduce 
that  the  performance  of  the  linear  optimal  filter  or  smoother  is  extremely 
poor  regardless  of  the  slgnal-to-nolse  ratio  and  behavior  of  the  incident 
responses  when  \  is  small,  since  the  normalized  variance  tends  to  unity 
uniformly  as  X  -*  0.  This  result  is  not  too  surprising  since  linear  techniques 
only  utilize  first  and  second  order  statistics,  and  the  Poisson  driven  process 
which  is  highly  "discontinuous"  contains  considerable  energy  in  the  higher 


order  statistics 
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Since  causal  estimation  for  such  systems  is  inherently  of  poor  quality 
when  V  is  small,  the  problem  leads  naturally  to  the  alternative  of  allowing 
some  delay  in  the  estimation  process  i.e.,  smoothing.  In  view  of  the  poor 
performance  of  the  linear  optimal  smoothing  and  the  none losedness  of  the 
optimal  estimates,  a  suboptlmal  non-linear  smoothing  scheme  will  be  con¬ 
sidered  for  the  estimation  problem.  While  the  scheme  may  not  perform  as 
well  as  ultimately  possible,  it  is  expected  to  perform  better  than  the 
optimal  linear  infinite-interval  smoother  for  low  enough  Incident  rate  X.. 

3.3  The  Sequential  Smoothing  Scheme  (SSS> 

In  this  section,  we  derive  a  sequential  smoothing  strategy  to  the 
problem  given  by  (3.2).  The  system  to  be  estimated  is  a  Poisson  marked 
process;  hence  the  knowledge  of  the  realization  of  the  Poisson  driving 
process  “  and  its  associated  mark  process  is  sufficient  to  determine  the 
realization  of  x^. 

The  approach  to  the  estimation  problem  considered  here  is  to  first 
solve  Che  particular  problem  of  estimating  the  Incident  process  T(  and  U, 
via  a  combined  sequential  estimation  and  detection  scheme  based  on  the 
criterion  of  maximum  aposteriori  probability  (MAP).  Then  the  estimate  of 
any  funct  Ion  ^(t  ,U)  Is  reconstructed  suboptlmally  through  the  estimate 

A 

of  and  U,  namely  ^(t.^,0).  Although  this  approach  Is  by  no  means 
optimal  in  the  sense  of  MMSE ,  this  is  nevertheless  a  robust  alternative 
when  Che  optimal  scheme  Is  Intractable.  In  the  proposed  sequential  scheme, 
the  observations  are  processed  in  subintervals  whose  length,  A,  Is  chosen 
such  that  the  probability  that  each  component  of  the  driving  process  having 
two  or  more  than  two  incidents  within  the  Interval  la  negligibly  small. 
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Each  subintervai  of  observations  is  processed  Co  detect  and  estimate ,  if 
it  exists,  Che  incident  in  the  interval  as  well  as  to  update  the  estimates 
of  past  incidents. 

In  order  to  reduce  the  memory  requirement  and  computational  complexity 
of  Che  scheme,  a  finite  memory  scheme  is  selected  whose  memory  size  depends 
on  the  marginal  improvements  in  Che  performance  and  the  desired  performance 
relative  to  an  infinite  memory  scheme.  Consequently  we  choose  not  to  update 
(re-esclmace)  the  estimates  of  Che  past  incidents  occurring  earlier  than  L 
subintervals  away  from  Che  new  intervals  as  shown  in  Fig.  1. 

Note  that  chere  is  a  maximum  delay  LI  inherent  in  the  structure  of 
the  scheme.  Any  estimates  with  delay  longer  than  LI  are  Che  same  as  that 
with  delay  LA  since  they  have  been  finalized.  L  usually  is  large  and  is 
chosen  as  a  tradeoff  between  performance  and  computational  complexity. 

Also  note  chat  the  scheme  structure  is  actually  independent  of  the  dealy 
imposed  by  the  original  problem.  The  delay  reconstruction  process  is  carried 
out  independently  as  the  scheme  sequentially  updates  the  estimates  of  the 
Incidents. 

3.3.1  Preliminary  Requirements  for  the  SSS 

To  be  able  to  apply  the  SSS  described  above,  there  are  a  few  basic 
conditions  we  have  to  satisfy  due  Co  the  combined  nature  of  the  detection 
estimating  scheme. 

Let  (T.,5,P)  be  Che  underlying  probability  space  for  our  estimation 
problem.  Then  the  random  processes  defined  in  Che  problem  statement  are 
all  measurable  random  processes  in  continuous  time  t€  (0,s).  For  example, 
if  B  is  a  Borel  sec  €  3  of  the  real  line  of  R* ,  ^  satisfies 


r 
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{  (<n,t)  :  \(v)  6  BJ  €  ?  x  3(0,-) 


(3.14) 


where  5(0,®)  Is  a  a-algebra  of  Borel  sets  on  Interval  (0,®).  That  simply 
says  the  random  process  T|  besides  having  well-defined  trajectories  can 
always  be  characterized  in  probablistic  terms. 

In  the  SSS,  in  order  to  reconstruct  a  given  functional  G(t,T)t(U)) 
based  on  the  estimate  of  T^dJ),  C  (t  )  has  to  satisfy  the  following 


condition. 


G<t,Tlt<«»)  €  B}  €  7c'x5(0,t)  V  t€  (0,-)  (3.15) 

« 

.  «  1  ^ 

where  B  is  a  Borel  set  €  3  of  R  and  7t  is  a  a-algebra  generated  by 

(!)  ,  0  <  s  <  t]  .  That  is  to  say,  we  require  that  GCt.'H  )  be  independent 

of  the  future  of  T)  and  that  it  can  be  completely  and  deterministically 

determined  by  the  knowledge  of  :  0  <  s  <  t} . 

To  examine  the  conditions  for  recoverability  of  T)  based  on  the 

observation  y£ ,  let  us  consider  the  vector  case  of  the  problem.  Let  x£ , 

~t  and  y  be  n- ,p-,q-dlmenslonal  vectors  respectively,  then  Eqs.  (3.4)  and 

(3.5)  are  still  valid  representations  of  these  processes,  viz. 

t 

*t  -  *(t,0)x0+  J  J*  $(t,s)bf  (U)M(ds.dU)  (3.16) 

0  u 

.  t 

2t  "  yt  ’  ht$(l »0)x0  -  vt  -  I  J  ht*(t,s)b#(U)M(ds.dU)  (3.17) 

0  U 

where  now  h£ ,  B^dJ),  ♦(t.s)  are  of  appropriate  dimensions.  To  estimate  Tlt , 
it  is  essential  that  the  observation  z^  is  a  non-trlvlal  function  of  all  the 
p  components  of  and  their  corresponding  mark  processes.  From  Eq.  (3.17), 
it  follows  quite  immediately  that  z(  satisfies  this  requirement  if  the 
following  holds 
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(1)  the  q*p  matrix  h^<$(c ,  s  )b^  (U)  contains  no  Identltlcally  zero  columns 
almost  everywhere  In  t  >  j  >  0  and  U. 

(2)  the  system  input  nxp  matrix  bs(U)  contains  no  columns  completely 
independent  of  U. 

Note  that  condition  (2)  can  be  easily  satisfied  with  a  modification  of  the 
system  equations.  Since  our  main  concern  is  to  reconstruct  ,  it  is 
already  satisfied.  Examining  condition  (l),  we  find  that  it  is  a  con¬ 
siderably  relaxed  condition  compared  to  the  usual  requirements  of  controlla¬ 
bility  and  observability  of  the  system  in  estimating  a  Wiener  driven  system. 
3.3.2  The  Estimation  and  Detection  Equations 

To  obtain  the  equations  for  the  Joint  estimation  and  detection,  we  first 
consider  the  simple  MAP  Interval  estimation  and  detection  problem  over  an 
interval  of  length  T.  Therefore,  the  estimates  of  the  incidents  and  mar*s, 
denoted  respectively  by  £,u,  have  to  satisfy  the  following 

where  f(*)  is  the  conditional  density  of  N(T)  and  t,u,  the  vectors  con¬ 
taining  all  the  incident  times  and  marks  over  (0,TA).  is  the  a-field 

generated  by  the  observations  {z(t);0  <  t  <  t} .  Since  the  dimensions  of 
l.u  depends  on  N(T),  the  maximization  can  be  carried  out  first  by  obtaining 
the  MAP  estimates  of  ^,u  assuming  the  value  of  N (T)  is  fixed  as  follows: 

-  max  XnfN(T)T  u(N*,T(N*),u(N*)|^)  (3.20)  , 

where  f.(N*),  u(N*)  stand  for  the  MAP  estimates  of  r,u  given  N(T)  -  N*. 
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The  maximization  of  r*.u*  in  Eq.  (3.19)  gives  a  set  of  equations  for 
£(N*)  and  u(N*),  the  estimation  equations.  The  maximization  over  N*  in 
Eq.  (3.20)  gives  the  detection  equations.  By  obtaining  an  expression  for 
Inf,  which  is  given  as 

infN(T)  ^u(N*.1*.u*|5Zt)  -  inP({zt;0<t<T}il*.u*.N*> 

♦  inf  (t*1  N*)  ♦  inf  (u*/N*)  a-  inP(S*)  (3.21) 


and  recognizing  that  the  first  quantity  on  the  right-hand  side  of  the 
equation  has  an  equivalent  expression,  namely  the  log -likelihood  function 
of  **.u*  which  is  given  as 


T  N*  S* 

-?  ■  At\*  -  I  h(t,r*;up)  1  Z  h(t  ,'pu*)] 

a  o  l*1  J"l 

we  obtain  the  following  set  of  equations  by  optimizing  over 


,  T  S(T)  bh(t;*  ,G  ) 

41  dt I*(c)  -  Eh(t/  ,G  )J[  - -1  ) 

<7  l  i-l  1  1  Uj 


(3.22a) 


,  T  N(T)  dh(t;T  u  ) 

4  f  dt(z(t)  -  Z  h(t;-?  G  )J|  -  J  -i-)  .  0  (3.22b) 

a2  ‘0  i-l  1  1 

for  1  <  J  <  N(T)  and  0  <  t. ,  <  r  ...  <  -  <  T.  Equation  (3.22)  actually 

—  ~  la  •*  (T ) 

is  Che  necessary  condition  for  the  MAP  estimates  of  2..H  given  N(T)  and 
always  guarantees  a  set  of  solution.  The  uniqueness  of  the  solution  set 
does  not  play  a  role  in  the  scheme  since  we  have  to  maximize  over  N(T) 
subsequently.  The  expression  to  be  maximized  with  respect  to  N  is  obtained 
by  substituting  the  distributions  associated  with  the  Poisson  process  into 
Eq.  (3.21)  which  reduces  to 

r 


I 


33 


n 

n 


A  ^ 

J(N)  -  Nio\  ♦  r  Xnt  (u  ) 

1-1  (3.23) 

T  N  N 

♦  4  I  i  E  h(tr,,G  )){  E  h(t;T  G  )) 

a2  J0  2  j-l  J  J  i-1  1  1 


3.3.3  Sequential  Approximation  Approach 

The  resulting  HAP  scheme  defined  by  the  solution  of  (3.18)  end  the 
maximlzat Ion  of  (3.19)  will  now  be  approximated  by  a  sequential  scheme. 

In  the  scheme,  we  sequentially  process  the  observations  In  subintervals 
each  of  length  A,  which  Is  chosen  such  that  the  probability  of  having  two 
or  more  Incidents  In  each  Interval  Is  negligibly  small.  Hence,  we  choose 
A  such  that 

XA  ^  »  «  i  (3.24) 


The  observation  over  the  kLh  subinterval  [(k-l)A.kA)  Is  used  to  finalize 
the  estimates  of  the  Incidents  prior  to  (k-L)A.  We  define  the  new  infor¬ 
mation  carrying  observation  y(t,k)  with  the  removal  of  the  finalized 
estimates  by 

N((k-L)A) 

y(t ,k)  •  z (t)  -  I  h(t;r  ,G  ),  t€  [ (k-L)A ,kA )  (3.25) 

l-l 

This  new  observation  In  (3.25)  will  be  used  in  the  MAP  equations  (3.18) 
and  (3.19)  to  yield  the  approximate  sequential  estimation-detection  scheme. 

at  as 

The  equations  to  be  satisfied  by  the  unflnallzed  MAP  estimates  " ,u  In  the 
Interval  ((k-L)A,kA)  when  the  kC^  subinterval  Is  observed,  are  obtained 
from  (3.22)  and  (3.25)  and  may  be  written  as 


kA 


M(kA) 


bh(t;*  ,u  ) 

-7  j  dt[y(t,k)  -  E  h('.;f  ,G  )]{ - ~ 

ff  }k-L)A  i-M((k-L)A)+l  ) 

♦  in£<«j>  •  0 


(3.26a) 
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kA 


4  dtfy(t,k)  -  Z  h(t;r  G  )J( 
C t  (k-L)A  N((k-L)A)+1 


H(kA) 

E 


5h(t  ;♦  .  ,u .) 

J— Lj 


3u 


J 


(3.26b) 


for  N((k-L)A)+1  <  )  <  N(kA) 

(k-L)A  <  TH((k.L)6Hl  <  •••  <  Tj  <  Tj+1  <*  *  *Tjj(kA)  <  kA* 

These  equations  are  Co  be  solved  for  fixed  N(kA)  (whose  values  will  be 
discussed  In  Che  sequel)  and  Chen  che  proper  choice  of  N(kA),  che  estimace 
of  N(kA)  should  be  made.  The  esdaate  $(kA)  of  N(kA)  Is  made  by  maximizing 
Che  following  expression  which  is  obCalned  direccly  from  (3.23)  over  che 
value  of  N. 


N 

E 


1 


kA 


dc(y(C,k) 


J  (M)  -  fUnX  ♦  I  Znf  (u  )  ♦  -“• 

i-N((k-L)A)+l  u  1  a*  (k-L)A 


N  N 

\  Z  h(C;f  ,u  .))[  Z  h(t  ;T  ,u  )  J 

l«N((k-L)A)+l  J-N((k-L)A)+l  J  J 


(3.27) 


Soce  that  che  deCecdon  is  carried  ouC  sequentially  wiCh  the  assumption 
of  Eq.  (3.24),  which  limits  che  values  of  N(kA)  for  the  maximization  Co  a 
sec  of  3  values  as  follows: 

(1)  If  an  incident  was  detected  in  Che  previous  interval  [  (k-2 )A , (k- 1)A) 

N  (kA)  €  { N(  (k- 1  )A ) ,  n(  (k-l)A)  ♦  1,  n((k-l)A)  +  l}  (3.28a) 

The  third  value  is  included  in  case  ChaC  the  previously  detected  incident 
has  been  a  false  detection. 

(2)  Similarly,  if  no  incident  was  detected  in  the  previous  Interval 
[  (k-2)A,  (k-l)A) ,,  then  N(kA)  is  limited  to  the  set. 

N(kA)  €  {N((k-1)A)  N((k»l)A  +  l,  N((k-l)A)+2}  (3.28b) 


j 


where  Che  last  value  accounts  for  Che  possibility  of  a  miss  in  Che  earlier 
subinCervai.  Note  also  Chat  for  each  value  of  N(kA),  it  is  required  Co 
solve  a  sec  of  equations  (3.26)  for  Che  estimates  r  (N) ,  u(N).  However, 
these  secs  of  equadons  are  uncoupled  Co  each  ocher  and  can  be  solved  In 
parallel.  Since  all  of  Che  incidents  except  Che  new  incident  (if  ic  exists) 
have  all  been  previously  evaluated,  lceraclve  perturbation  mechods  may  be 
employed  to  solve  them. 

The  sec  of  values  for  N(kA)  can  obviously  be  increased  co  include  more 
values  to  cover  Che  possibility  of  multiple  wrong  detections  in  previous 
sub  intervals. 

3.4  Performance  of  the  SSS 

3.4.1  Asymptotic  Performance  of  the  SSS 

The  analysis  of  Che  performance  of  Che  sequential  approximation  scheme 
is  quite  Involved;  exact  analytic  results  are  rather  complex  to  obtain. 

Vfe  consider  only  the  scalar  t ime- Invariant  case  and  derive  the  asymptotic 
performance  of  the  scheme  as  the  intensity  tends  to  zero.  For  comparison 
purpose,  we  express  the  performance  in  terms  of  the  error  variance  of  the 
estimate  of  the  system  state  normalized  by  Che  variance  of  the  state 
(average  power). 

When  Che  Intensity  Is  small,  the  estimation  error  resulting  from  the 
scheme  can  be  modeled  as  Poisson  filtered  process,  with  each  error  incident 
response  as  a  result  of  the  detection  and  estimation  over  each  subinCervai. 
Since  we  are  making  estimation  and  detection  for  every  subinterval,  Che 
Intensity  of  the  driving  process  Is  i/A.  Therefore  the  estimation  error, 
denoted  by  W(t)  can  be  represented  as 
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W(t)  -  J*  J  c(t;<7.V)  fff(da.dV)  (3.29) 

V  0 

where  c  (t  ;9  ,V)  is  the  incident  response  of  the  average  error  made  according 
to  the  result  of  the  detection  indicated  by  the  mark  V.  and  is  the 
measure  of  the  underlying  error  driving  provess  with  intensity  1/A. 

Define  the  mark  V  as  folios 

V  •  d , Q  indicating  a  miss  in  the  detection 

V  •  d  indicating  a  correct  detection  of  an  incident 

(3.30) 

V  *  dg,  indicating  a  false  alarm 

V  *  d^  indicating  a  correct  detection  of  no  incident 
Hence,  the  distribution  of  the  mark  is  given  by 

p(V  -  d,Q)  -  »Pl0(A) 
p(V  -  d  )  -  a(l  -  P.fl(A)) 

(3.31) 

P (V  -  dQl)  -  (l^)p0l(A) 

P(V  "  d00)  *  (l-o)(l*P0l(A)) 

where  pQ^(A),  P-q(A^  denote  the  probability  of  a  false  alarm  and  a  miss  in 
the  detection  over  an  Interval  of  length  A. 

Note  that  with  this  model, 

«2(tT;d()0)-0  (3.32) 

c2(t  r;d  )  -  E  h2(t;r,u)  (3.33) 

U 

The  variance  of  W(C)  is  therefore 

Var(W(t))  -  A  E  J  c2(t  0;V)dt  (3.3A) 

V0 

m 

•  ^  E  f  *2(t  0; V)dt 
»  V  - 


(3.35) 
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The  substitution  of  Eq.  (3.31)  into  (3.34)  yields 


Var (V(t))  -  j  (p01  (A)  j"c2(t.0;d0l)dt 

0 


+  ^(i  *  Pl0(4>J  ;“c2(c.O;du)dt 


+  XA  J  dt(p10(A)<2(t.0;dl0)  -  p0l(A)c2(t.0;d01)j)  (3.36) 

0 


which  after  normalization  by  Var(X(t))  given  in  (3.11),  becomes  in  the 


limit  as  X  -*0, 


P  (— ) 

lim  Var  (V(t ) )  _  llm  r0l  V  _  l 


X-0  varoclt)) 


(3.37) 


where  p^f")  !•  now  t*'e  probability  of  wrong  detection  in  a  semi-inf inite 


interval.  The  probability  of  false  alarm,  Pq^(&)  1*  given  by  (see  [12]) 


-  .  Zn(l/0)  V6A  . 
P0i<A>  “  «rf<  — -  +  — J —  > 


(3.38) 


where 


E,  >  ■—  E  J*  h2  (t ,0,U)dt 

cr  U  o 


(3.39) 


In  fact  when  1  is  small,  we  have 


•  2 

,  /*%  «  (t ;0,d  )dt 

P^L-  <  V«r(VCt))  ^  ,  L  £ _ 


-  Var  (X(t))  -  a 


(3.40, 


E  f  h2(t;0,U)dt 

0  0  J 


Hence,  we  expect  the  normalized  error  variance  to  approach  p-, (")/a  as 


X -*  0.  Furthermore  it  increases  linearly  as  X  with  slope 


c  (t;0,du)dt 


E  h  (t ;0,U)dt 
U  0 


_ 


which  is  expecced  to  be  very  smell  when  the  signal-to~noise  ratio  is 
reasonably  high,  i.e.  when 


Consequently  the  normalized  variance  is  approximately  p  (A)/0  when  X  is 


In  view  of  the  fact  that  the  normalized  variance  of  the  error  resulted  from 
the  linear  opcimal  filter  tends  to  unity  regardless  of  chef ignal-to-noise 
ratio  and  behavior  of  the  incident  responses,  the  scheme  performs  extremely 
well  over  the  linear  opcimal  filter,  since  under  normal  conditions  of 
signal-to-nolse  ratio 


when  X.  is  small.  The  results  imply  that  there  exists  a  rate  X*>0  such 
that  for  \  €  (0,X*J  the  subopcimal  nonlinear  scheoe  performs  better  than 
the  optimal  linear  noncausal  schesw. 


Performance  Simulation  of  the  SSS 


The  sequential  scheme  was  slaulated  on  a  digital  computer;  in  each 
case,  it  was  done  over  a  total  length  of  time  Co  Include  20  incidents. 

The  performance  in  terms  of  normalized  error  variance  was  plotted  against 
the  intensity  ranging  from  0  to  2.  L  was  chosen  to  be  4  and  a  -  0.15. 

The  system  state  was  reconstructed  with  delay  of  2. 
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the  computed  performance,  Eq.  (3.42)  was  plotted  as  a  continuous 
graph  in  each  case  and  the  simulated  performance  for  different  values 
of  1.  were  plotted  as  points  with  appropriate  notations.  For  comparison 
purposes,  the  computed  normalized  steady  state  error  variance  of  the  linear 
optimal  filter  (LOF)  was  larger  than  0.90  in  all  the  cases  over  the  range 
1  ■  0  to  2.  Several  examples  were  considered  as  follows: 

x(0)  -  0 


t  >  0 

The  mark  U  is  assumed  to  be  Gaussian  with  variance  5  and  three  possible 

cases  for  the  mean:  (1)  S  (11)  7  (ill)  10  (see  Fig.  2). 

Example  II:  x(C)  -  UT|(t)  x(0)  -  0 

z(C)  •  x (t )  +  v (t ) 

The  Incident  response  Is  a  step  function  indicated  by  |t(>), 
h(t;T,U)  -  Uw(t-T) 

Again,  U  is  Gaussian  with  variance  5  and  mean  taking  one  of  three  values: 

(i)  5  (11)  7  (ill)  10  (see  Fig.  3). 

Example  III:  The  Incident  response  Is  a  rectangular  function, 

h (t  ,r  ;U)  -  UMt-T)  -  u(t-T-L)] 

We  Include  two  cases  L-0.2  and  0.3  with  same  Gaussian  U  as  above  (see 


Example  l:  x(t)  •  -2x(t)  ♦  M(t) 


Hence 


z(t)  -  x (t )  +  v (t ) 
-Z(t-T) 


h(t;T,U)  "  Ue 


N(t) 

y(t)  -  I  U.e 
l-l 


-2 (t-r ) 


Fig.  4) 
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Example  IV:  The  Incident  response  Is  assumed  to  be  a  ramp  function  of 
slop  U. 

h(t,T;U)  -  U(t-T) 

Again,  U  Is  Gaussian  with  variance  5  and  mean  (i)  40  (ii)  100  (111)  200 
(see  Fig.  5). 

To  Investigate  the  effect  of  Increasing  the  number  of  values  In  the 
set  ( N (VA ) }  for  maximization  l.e.  allowing  delay  detection,  Che  case  in 
Example  IV  was  reconsidered  and  compared  to  the  case  when  one  more  value 
of  N(kA)  in  the  detection  was  allowed.  The  results  are  shown  in  Fig.  6. 

The  simulation  agrees  extremely  closely  with  the  approximate  expression 
for  the  performance  when  A  is  small.  And  as  Fig.  6  indicates,  the  per¬ 
formance  of  the  scheme  improves  considerably  if  we  Increase  the  set  of 
values  for  N(.kA)  in  the  detections. 

3.5  Cone luslon 

In  this  chapter,  we  have  developed  and  examined  a  sequential  esti¬ 
mation-detection  scheme  as  an  approximate  solution  to  the  state  estimation 
problem.  Its  asymptotic  performance  has  been  derived  and  shown  by  digital 
simulations  to  be  an  extremely  good  indicator  of  the  true  performance  when 
the  intensity  is  small.  Hence,  under  the  situation  of  reasonable  signal- 
to-nolse  ratio  (SNR)  in  the  observations,  the  scheme  performs  considerably 
better  than  in  the  MMSE  sense,  the  linesr  optimal  filters  when  X  is  small. 
Indeed  the  asymptotic  performance  of  the  linear  optimal  filter  is  extremely 
disappointing,  regardless  of  the  SNR  or  how  the  incident  responses  behave. 

Notice  that  the  SSS  performs  reasonably  well  for  both  stable  and 
unstable  systems  under  normal  slgnal-to-noise  environments  as  indicated 


Figure  5.  Simulated  and  computed  performance,  Var (c  )  va.  X,  for  Example  IV. 
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in  dll  Che  examples  (see  Figs.  2-6).  In  fact  is  some  cases,  it  works 
better  for  unstable  systems  chan  for  Che  stable  case  as  example  I  and 
II  show.  As  indicated  in  example  IV  and  Fig.  6,  the  performance  of  the 
SSS  can  be  further  improved  by  an  additional  delay  in  the  reconstruction 
process . 

As  we  allow  more  computational  capacity  in  the  sequential  scheme, 
choosing  L  "*  •  and  increasing  the  set  of  values  for  N(k£)  in  the  detection, 
the  scheme  becomes  MAP  optunal.  Hence  in  many  practical  cases,  it  may 
Indeed  be  close  to  the  optimal  MMSE  scheme. 
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CHAPTER  4 

STATE  ESTIMATION  OF  THE  WIENER-POISSON  DRIVEN  LINEAR  SYSTEM 
4. I  General 

We  now  address  ourselves  Co  Che  general  estimation  problem  sCaCed  in 
Chapcer  1.  As  in  Chapcer  3,  we  allow  an  addldonal  uncercaincy  in  Che 
syscem  by  incroducing  a  mark  process  exciced  by  che  Poisson  process  as 
follows 

«t  "  at*tdt  +‘StdWc  *  bt(U)d\  (4‘la) 

where  U  is  che  mark  process  assoclaCed  wlch  Che  Poisson  process  and  is 
assumed  co  have  known  scacisclcs,  l.e.  probablllCy  denslcy  f^(u).  Again 
che  objeccive  is  Co  develop  an  esdmadon  scheme  wlch  delay  (smooching)  for 
Xt  from  obiervadom,  vi*., 

dy£  •  htX£dc  +•  dv^  (4.1b) 

In  chis  chapter,  we  develop  a  subopt laal  sequential  scheme  (SSS)  as 
che  one  in  Chapter  3  for  Poisson  driven  systems.  The  scheme  is  a  modification 
of  che  one  considered  earlier,  and  consists  of  performing  detection  and 
escimadon  of  che  Poisson  Incidents  in  a  sequence  of  small  subintervals. 

The  reconstruction  of  Che  system  state  X^  will  be  obtained  from  the 
estimates  of  the  Poisson  incident  process  followed  by  a  smoothing  filter. 

The  estimation  and  detection  of  the  Poisson  incidents  are  performed  using 
the  MAP  criterion  as  in  Chapter  3,  while  the  smoothing  filter  employs  the 
MMSE  criterion.  Basically  the  only  difference  in  the  SSS  for  the  process 
(4.1)  from  that  in  Chapter  3  is  that  in  the  reconstruction  process  we  apply 
the  MMSE  smoothing  to  the  Wiener  driven  part  of  Che  system  whereas  in 
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Chapter  3  the  reconstruction  is  purely  deterministic  once  the  estimation 
and  detection  of  the  Poisson  process  is  finalized. 

Due  to  the  sequential  structure  of  the  SSS,  the  errors  arising  from 
any  deviations  in  the  estimation  and  detection  of  T\  will  tend  to  propagate 
and  accumulate.  A  compensation  approach  will  be  included  in  the  SSS  to 
eliminate  the  error  propagation  effect.  The  numerical  algorithms  in  the 
optimization  procedures  and  the  solution  of  the  equations  in  the  SSS  are 
discussed  and  presented  in  an  algorithmic  form.  The  asymptotic  performance 
for  the  t ime- invar iant  case  for  small  intensity  is  derived  analytically 
and  simulated  numerically. 

4.2  Single  Interval  Suboptimal  Smoothing 

The  system  given  by  Eq.  (4.1)  is  by  assumption  a  measurable  process  of 
the  Wiener  input  process  and  the  Poisson  input  process  together  with  the 
initial  (random)  conditions.  If  the  Poiston  Inpuc  is  given,  and  the  initial 
conditions  are  Gaussian,  the  system  process  can  be  easily  shown  to  be 
conditionally  Gaussian  (7).  This  conditional  Gaussian  property  enables  us 
to  obtain  an  optimal  estimate  in  closed  form  if  the  Poisson  process  is 
known.  Since  the  system  is  driven  linearly  and  Independently  by  the  two 
input  processes,  that  optimal  estimate  is  in  fact  the  optimal  MUSE  estimate 
of  the  Wiener  driven  part  of  the  system,  and  in  our  case  is  Just  the  Kalman 
filter. 

Let  the  conditional  filtered  and  smoothed  estimates  of  the  system  X( 
over  the  Interval  (0,T)  be  denoted  by  m£  and  respectively,  l.e. 


0  <  t  <  T  (4.2) 


0  <  t  <  T 


(4.3) 
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D 

where  In  our  notation,  5  denotes  the  o-field  generated  by  ;  0<t<s}. 

s  c  — 

Note  that  tn  and  m  are  not  the  filtered  and  smoothed  estimates  of  the 
t  t/T 

system  but  rather,  the  pseudo-estimates  based  on  the  given  knowledge  of  the 
Poisson  input  ^ .  The  Poisson  input  can  be  specified  over  (0,T)  by  giving 
the  number  of  incidents  over  the  interval  and  the  incident  times  and  marks. 

To  indicate  the  dependence  on  N(0,T)  (N(T)  for  short)  and  (r^u)  of 
m^  and  ac/j»  use  Che  explicit  notation 

0  <  C  <  T  (4.4) 

®t/T  "  “t/T(N(T)’  *  0  <  C  <  T  (4.5) 

With  these  notations,  a  likelihood  function  of  m^  observed  in  white  Gaussian 
noise,  denoted  by  LRfn^),  can  be  constructed  using  the  concept  of  estimator- 
correlator  receiver  [10,11]  et  al. 

T  .  T  , 

LR(®C  (N(T),t,u)]  -  exp{  J  J  J  Btdt)  (4*6) 

Hence  for  the  MAP  estimate  of  N(T),  u  and  we  have 

J[N,L,u]  -  Zn  LR(m  (N.t.u)  ♦  Znf  .(M.T.u)  (4.7) 

where  f„._x  (.,-,-)  is  the  aprlori  probability  of  N(T)  and  -,u. 

S  (T  )^,u  “ 

Therefore  the  MAP  estimate  of  N(T)  and  *,u  denoted  by  N  and  £,u  satisfy 
the  following, 

J(S,^,G]  -  max  J(N*,T*,u*]  (4.8) 

r  -j 

Notice  that  the  dimension  of  ^,u  is  N.  The  maximization  can  be  done  by 
assuming  the  value  of  N  and  maximizing  with  respect  to  ' ,u  first.  Therefore, 


I 
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J[8,r,u)  -  max  max  { J (N*,t*,u*J }  (4.9a) 

N*  r*,o* 

-  max  J[N*.  ^ (N*) ,  u(N*)l  (4.9b) 

N* 

Although  N*  can  take  up  any  positive  Integer  value,  large  values  of  N* 
can  be  neglected  in  the  maximization  by  an  appropriate  choice  of  the  length 
of  the  interval  T.  The  aposteriori  probability  of  N(T)  is  uniformly 
continuous  with  respect  to  its  apriori  probability,  which  can  be  made 
arbitrarily  small  with  the  choice  of  the  time  interval.  Hence  the  maxi¬ 
mization  can  be  done  over  a  set  of  finite  elements,  N*  *  0,1.2,...K 
neglecting  values  higher  than  K. 

After  the  MAP  estimates  of  N  and  ^,u  are  determined,  the  estimates 
of  the  system  can  be  readily  determined  in  a  suboptlmal  fashion  by  sub¬ 
stituting  the  estimates  of  T,u,N  in  the  expressions  for  and  (4.4) 

and  (4.5)  (see  Fig.  7). 

From  Eq.  (4.1),  we  have 


t  t 

X  •  ♦(t,0)Xo  ♦  *  ♦(t.s)B  dW  +  ]“  <J>(t,s)b  (U)d?  (4.10) 

c  0  b 

If  £  T)  #  (U)  •  0  <  s  <  t)  is  known,  the  system  is,  in  affect,  a  Wiener  driven 
process.  The  last  term  of  the  equation  Just  gives  the  system  a  different 
mean  function.  Hence  we  can  write  q»c  at  follows, 

*c  ,C 

®t  “  Xt  -  ♦(t.*)bg(U)  <Dg 


/A  1  1  \ 


where  X. 


is  the  estimate  of  the  Wiener  driven  part,  and  is  given  by  its 


Kalman  filter 
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K  ■  tfe*t  +  Pt  Vt  l*t  '  hc  I  ♦<t*»>VU)dH§  *  \2£l  t  t  o 

0 


(4.12.) 


PC  -  1C  PC  +  + 


2  -l.c' 


t  c 


\\  ♦  Vt \ 


t  >  0 


(4.12b) 


with  XQ  -  XQ  , 


P*”  •  P 
*0  *0 


After  some  algebraic  manipulations,  we  have  equivalently, 

t 


and 


mt  "  x?  +  xtl  +  6  *<t.»)bs(u)<n> 


£  -  V?  ♦  -  htxt°i 


(4.  13) 


(4.14) 


„.  t  c  ?  1 

Xc  -  ;  ♦(t.s)bf(U)  (  j  <p(t,u)hVVdu]  dTJ 

0  s 

4  ■<*)  l 

-  Z  itCrt;ut) 
l-l 


(4.15.) 


(4.15b) 


where  tp(t,u)  is  the  state  transition  function  of  the  homogeneous  part  of 
the  filter  in  Eq.  (4.12)  and  g^(r^;u^)V  1  *etlsfy  the  causality  condition 
by  definition  of  the  Integral  (4.15),  i.e. 


*t^W  *  0 


V  t  <  T 


Vi 


(4.16) 


Similarly,  we  have 


J  ♦(t,s)b>(U)<ITli  - 
0 


N(t) 

z 

1-1 


iJul> 


(4.17) 


Note  that  in  Eq.  (4.13),  X®  is  the  only  term  on  the  right  side  of  the 
equation  that  does  not  depend  on  the  realization  of  1| ;  It  Is  the  only 
term  that  depends  on  the  observation  yf  In  fact.  The  remaining  two  terms 
depend  only  on  the  realization  of  ~  and  not  y  ,  they  are  completely  deter¬ 
mined  by  the  knowledge  of  M(t),  T,u. 


■SRP 
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The  smoothed  pseudo-esc imate  mt/T  can  be  readily  constructed  from 
and  is  given  as  follows  [7,9,12]  (see  also  Fig.  7) 


-c  NCT)  0 

■t/T  '  Xt/I  \Zml  *.<W 


where 


K/t  ■  2pt  Kn-fyt 

with  ^/T  - 


Pc  -  PC 
*T/T  T 


(4.18) 

(4.19a) 

(4.19h) 

(4.19c) 


The  substitution  of  Eqs.  (4.13-4.17)  into  Eq.  (4.7)  yields  an  equivalent 
expression  for  the  MAP  maximization  which  can  be  expressed  as 


T  N 

JT(N.L.u]  -  J  dtixj  E  g°(T1.ut)Jl-  -  yt) 

0  1-1 

T  N 

-  \  r  dt[xj  ♦  r  g°r.,u  )12  ♦  NXnX 

2  ft  C  l-l 


dt 


i  ,(«t(Ti*ui)  +  *?(Ti*ui)I 


i-i 


vy  -£?  -  y  r  l*JcTl.«l)  *  i^CTfU^lU  NXnX  (4.20) 

1  i-l 


where 


This  expression  can  be  further  simplified  into 

T  N  ,  "  1 

Jt0».t,u)  •  dt  [  E  *,^,.0)1*,.  -  J  1  8t(Tl,Ui),f  + 

T  0  [l-l  l“l  J 

gtT,U)  ^  g^(T,U)  ♦  g‘(T,U) 


A  * 

*t  -  yt  +  xt 


NinX.  (4.21) 

j 

(4.22) ' 

(4.23) 


Note  that  J  •  0  when  N  *  0.  Also  (4.21)  Is  in  the  conventional  form  of 

the  likelihood  function  where  z^  is  the  observation  of  the  signal 
N 

I  g  (T,,u  )  the  presence  of  white  Gaussian  noise, 
i-l 

To  summarize  this  section,  we  have  obtained  the  general  equations  for 

the  proposed  smoothing  scheme.  The  crucial  step  is  to  isolate  the  parts 

of  the  system  which  depend  solely  on  the  realization  of  *lt(U).  They  are 

finally  reconstructed  through  the  detection  and  estimation  of  the  incidents 

of  the  process  "^(U).  The  other  part  of  the  system  is  estimated  using  a 

MMSE  optimal  filter  which  most  importantly  has  a  closed  form.  Notice  that 

the  linearity  of  the  system  model  is  not  required  for  obtaining  the  MMSE 

closed  filter.  It  only  requires  that  the  system  conditional  on  the  cr-field 

gives  rise  to  an  optimal  filter  of  the  closed  type;  in  our  case  the 

- 

system  is  conditionally  Gaussian  with  respect  to  5  '  hence  gives  rise  to  a 
closed  optimal  filter,  the  Kalman  filter. 


4.3  Sequential  Smoothing  and  Its  Numerical  Algorithm 

In  this  section,  we  consider  a  sequential  smoothing  scheme  approxi¬ 
mately  implement  the  scheme  proposed  in  4.2.  Such  a  sequential  scheme 
will  make  Che  implementation  in  real  time  feasible,  and  offers  the  advantages 
of  sequential  iterative  programing  for  the  computations. 

In  the  scheme,  observations  in  subintervals,  each  of  duration  A  are 
processed  sequentially  to  estimate  Poisson  incidents  in  that  new  subinterval 
and  also  update  those  in  the  past  (or  part  of  the  past).  Then  the  system 
state  is  reconstructed  subject  to  the  permissible  delay  based  on  these 
estimates  and  the  estimate  of  the  non-Poisson  driven  part  of  Che  system. 
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The  approach  for  obtaining  the  estimates  of  the  Incidents  is  similar 
to  chat  in  the  previous  chapter.  Basically,  upon  receiving  a  new  sub- 
interval  of  observat ions ,  the  problem  of  a  single  Interval  smooching  as 
in  Che  last  section  has  to  be  solved.  Although  there  are  some  nice 
properties  arising  from  the  sequential  structure  of  the  scheme  Che  major 
component  of  the  scheme  lies  in  the  maximisation  of  the  expression  in 
Eq.  (4.21). 

Note  chat  the  expression  to  be  maximized  does  not  have  a  second 
derivative  with  respect  to  ~,u,  since  it  Involves  the  derivatives  of  z 
which  contains  white-noise  components.  Hence,  second  order  iterative 
numerical  techniques  for  optimization  do  not  apply. 

Another  point  to  note  in  Che  maximization  is  Chat  Che  number  of 
incidents,  i.e.  the  dimension  of  r^,u  Increases  as  the  length  of  smoothing 
interval  increases  by  &  each  time.  Hence  a  robust  approximation  approach 
which  requires  up-dating  only  part  of  the  previous  estimates  is  considered. 

In  order  to  develop  the  algorithm,  we  shall  assume  that  we  have  the 
estimates  of  the  number  of  incidents  and  their  times  and  marks  to  time 
t  *  ml ,  denoted  by  N(rul),  ^(nA),  u  (mi )  respectively.  Then  we  add  as  new 
subinterval  of  observations  of  length  A,  where  A  is  to  be  so  chosen,  as  to 
satisfy 

)lA  -  Or  «  1 

i.e.,  the  probability  of  having  more  than  one  incident  in  the  new  sub¬ 
interval  is  negligibly  small.  Hence  the  number  of  incidents  up  tu  c  *  ml 
can  Cake  on  two  values,  namely 


N(nA)  and  N(nA)  +  1 


(4.24) 
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Therefore  expression  (4.21)  only  needs  to  be  maximized  for  two  sets  of 
*((n+l)4),  which  can  be  done  independently.  Note  that  the  entrees  of 
the  vectors  T,u  have  been  ail  or  all-but-one  estimated  previously  using 
the  data  up  to  t-nA.  Although  they  have  to  be  updated  since  more  data  has 
been  considered,  their  estimates  can  serve  as  an  approximate  point  in  an 
iterative  algorithm  for  the  new  estimates.  There  are  quite  a  few  iterative 
techniques  that  are  applicable  to  the  maximization.  Among  the  available, 
we  choose  an  approach  that  does  not  require  taking  any  derivatives  and 
converges  at  a  relatively  fast  speed. 

w.3.1  Numerical  Iterative  Algorithm 

The  iterative  algorithm  consists  of  first  determining  for  any  given 

iterated  point  a  direction  along  which  the  maximization  of  J  can  be  done, 

then  the  scalar  maximization  is  done  by  some  efficient  search  procedures. 

-k 

More  precisely,  if  we  let  be  the  k-th  approximation  point  to  the  optimal 

r*  which  lies  in  some  N-dlraensional  Euclidean  space  E  ,  the  new  approxi- 
-k+1 

nation  point  will  be  given  as 


d 


-k+l 


,d€  s 


(4.25) 


where  d  is  the  chosen  direction  for  maximization  of  J  and  5  is  the  maxi¬ 
mization  constant  that  gives  the  maximum  J  subject  to  the  constraint  that 
the  ordered  Incident  times  must  satisfy 

0  -  ?  1-1  <  ?i+l  ^  i  ■  2 ,3-N  (4.26) 

Actually  constraint  (4.26)  g'.ves  the  range  for  a  in  the  optimization,  since 


-k+l  3k+l 

Vi  ‘  ?I 


i  -  2---N 


57 


! 

E 

r 

u 


u 


i 

II 


0 

D 

0 

£ 

E 

£ 

II 

: 

i 


we  have 


t  ^  +  ad  <  t  *  +  ad 

i-1  i-1  i  * 


which  results  In 


a  *  \ 


TrTi-i 

di-di-i 


i  l-i 

d.-d. 


V  Wi 


d  >  d 
l-i  i 


d  .  <  d 
i-1  1 


(4.27) 


1  •  1,2 . N  +  l. 

We  take  ?J+l-  0,  dQ  -  Vl 

we  assume  d^  >  0,  we  have 


~k+l 

0  and  r  ■  nA,  and  without  loss  of  generality 
1 


r 

d. 


max  ( 
1 

d  .<  d 
l-i  l 


T  -t 
1 


1-1 


dl-l-dl 


.)  <  a  < 


min 
1 

d  >  d 
1-1  1 


r l"’ 1- 1 
<  d  -d")  < 
dl-l  di 


nA 


k+1 


(4.28) 


which  is  the  feasible  range  of  5  for  the  maximization  search  procedure. 

Mathematically ,  if  the  iterative  maximization  algorithm  is  denoted 
by  A,  it  can  be  represented  as 

A  -  M*D  (4.29) 


N  2  N 

where  D  ■  E‘  ”*  E  is  a  polnt-to-sct  mapping  that  determines  a  direction 

2N  N 

for  maximization  for  the  objective  function  J;  and  M  •  E  -*  E  is  a  polnt- 
to-sct  mapping  that  gives  a  better  approximation  point  for  the  maximization 
given  the  direction  and  the  iterative  point. 

We  choose  the  Golden-Section  Search  for  M  and  the  Conjugate  Directions 
Technique  for  D.  The  convergence  of  algorithm  A  solely  depends  on  the 
closedness  of  M  and  D.  For  our  choice  of  M  and  D,  it  can  be  shown  easily 
both  closed  and  hence  convergent  procedures  [13,14]. 
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Due  to  Che  unimodal  property  of  J  In  the  mean  sense,  the  Golden 
Section  Search  procedure  is  a  very  efficient,  convergent  search  technique 
over  the  feasible  range  of  5.  It  does  not  require  any  derivatives  of  the 
objective  function  J,  avoiding  the  analytic  difficulty  of  taking  derivatives 
of  white  noises. 

The  conjugate  directions  usually  defined  with  respect  to  a  quadratic 
function  are  a  set  of  directions  such  that  maximizing  in  each  of  the 
directions  will  give  the  maximum  of  the  quadratic  function.  In  E*  ,  there 
are  N  such  directions  in  the  set.  The  generation  of  such  directions  for  a 
given  quadratic  form  does  not  require  taking  derivatives  and  for  maximization , 
the  order  of  the  directions  maximized  is  immaterial.  Hence  the  algorithm 
converges  within  N  steps  in  case  of  maximizing  a  quadratic  function. 

Note  that  our  objective  function  J  to  be  maximized  is  in  general  non¬ 
quadratic.  Generation  of  such  a  set  of  conjugate  directions , is  impossible. 
Nevertheless,  we  exploit  a  quadratic  approximation  to  J  without  explicitly 
calculating  the  Hessian  matrix,  which  in  our  case  is  not  well  defined  due 
to  the  white  processes  involved.  We  iteratively  generate  a  set  of  conjugate 
directions  which  gives  a  better  optimization  to  J.  With  several  iterations, 
the  requisite  Information  about  the  Hessian  is  developed.  The  overall 
procedure  can  be  thought  of  as  generating  conjugate  directions  to  a  quadratic 
that  approximates  J.  Since  the  algorithm  for  the  procedure  is  closed  [13], 
the  overall  scheme  is  convergent. 

For  the  details  of  the  complete  maximization  scheme  see  Appendix  C. 

For  our  choice  of  A,  the  length  of  the  subinterval,  at  most  there  is 
only  one  Incident  time  that  has  never  been  estimated,  the  rest  are  all  close 
to  the  optimal  value  maximizing  J.  The  numerical  procedures  should  converge 
extremely  fast  within  a  few  iterations. 
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where  N 

•  N(n&)  -  N((n-K)A) 

(4.31a) 

?l 

“  T l +  N( (n-K)4) 

(4.31b) 

“i 

"  °1  ♦N((n-K)A) 

(4.31c) 

and 

N((n-K)A) 

“t‘2  *  «t(Ti'Ui) 

(4. 3 Id) 

1-1 


Notice  that  expression  J  In  (4.31)  Is  In  Che  sane  form  as  J  In  (4.21).  The 
difference  Is  chat  the  number  of  unknowns  -^’s,  now  Is  less  chan 

•^'s,  ul  *  by  exactly  2N((n-K)&).  Indeed  the  expected  nunfcer  of  unknowns 
In  3  Is  2K(\A)  which  Is  small  since  (X.&)  Is  small. 

(II)  Shortening  the  Interval  of  Integral  In  J  Eq.  (4.21)  by  lncreaaing 
the  lower  limit  of  the  Integral,  placing  all  Che  weight  of  Che  observation 
In  the  shortened  interval  for  Che  estimation.  If  the  Interval  for  the 
Integration  In  the  expression  of  J  Is  (O.nA),  It  can  be  shortened  to 
[)i&,nA)  where  M  Is  a  parameter  to  be  determined.  Combining  approximation  I 
and  II,  we  can  write  the  expression  for  J  as 

nA  B  .  N 

3  . -  J  <**£  I  MVvK*.  -  7  r  •.(*..«,))  (4-32) 

“  (n-K4M)i  1-1  C  1  1  1  1-1  C  1  1 

Note  chat  In  (4.31),  ail  Che  T ^ ' a  are  within  the  interval  of  the  Integral, 
and  In  (4.32)  some  of  them  are  not  but  their  Incident  responses  arc. 

While  both  approximation  I  and  II  reduce  the  computational  complexity 
and  enhance  the  speed  of  the  process,  they  are  different  in  nature.  The 
application  of  one  or  both  and  subsequently  the  choice  of  K  and  M  depend 
on  the  tradeoff  of  various  aspects  as  availability  of  computational  facilities 
maaerlcal  operations,  speed  and  desired  performance  of  the  overall  scheme. 


Basically,  for  stable  systems  especially  those  of  large  negative 
eigenvalues,  Approximation  I  is  a  robust  choice  since  new  observations 
are  unlikely  to  improve  very  much  the  estimates  of  thes  incidents 
occurring  in  the  remote  past.  For  unstable  systems,  Approximation  II 
nevertheless  is  a  good  alternative  in  reducing  the  computational  burden. 

In  that  case  the  incidents  in  the  past,  are  still  being  re-estimated, 
but  based  primarily  on  their  effect  on  the  recent  observations.  Such  a 
weighting  of  latter  observations  is  consistent  with  the  unstable  nature 
of  the  Incident  responses. 

4.4  Asymptotic  Performance  of  the  SSS 

Since  the  general  performance  of  the  SSS  is  difficult  to  derive, 
we  consider  only  the  t ime- invariant  case  and  derive  the  asymptotic  per* 
formance  of  the  scheme  as  the  intensity  tends  to  zero. 

From  Eq.  (A.  19)  which  gives  the  estimates  of  the  system,  the  estimation 
error  l t  resulting  from  the  overall  scheme  can  be  expressed  as  a  sum  of 
two  errors:  one  from  the  estimates  of  the  Poisson  driven  part  and  one  from 
the  smoothing  of  the  conditional  Wiener  driven  part  of  the  system.  The 

0k 

smoother  X£  „  given  In  Eq.  (4. 12)  is  derived  conditionally  on  the  assumed 
knowledge  of  ”t(U),  i,e,  5  ;  due  to  the  linearity  of  the  smoother,  its  error 
can  be  expressed  as  a  sum  of  an  ~t»free  term  and  an  ^-dependent  term. 
Therefore  we  have 

"  *t  *  *t  *  *t  0  <  t  <  T  (4.33) 

where  and  denote  the  *lt*free  and  T| -dependent  term  of  the  smoothing 

error  of  the  Wiener  driven  part  of  the  system;  e^  denotes  the  error  of  the 

*c 

estimate  of  the  Poisson  driven  part.  Since  the  conditional  estimate  X^.  ^ 


! 


1  y 

with  respect  to  5r  x  S' ,  tt  can  be  shown  easily  from  Eq.  (4.12)  that 


,0  _c 
‘c  *  Pc/T 


(4.34) 


which  is  Che  variance  of  Che  conditional  smoother  given  in  Eq.  (4.18).  In 
face  ic  is  chae  variance  one  would  obtain  in  smoothing  the  Wiener  driven 

pare  of  the  system  using  an  MMSE  optimal  smoother.  Note  that  Che  estimate 

*  T|  c 

_  is  conditional  on  ,  but  the  variance  ^  is  independent  of 

When  Che  intensity  \  is  small,  the  estimation  error  resulting  from  the 

scheme  can  be  modeled  as  a  Poisson  filtered  process  { 1 ,50  J ,  with  each 

incident  response  as  a  result  of  the  detection  and  estimation  over  each 

sublnterval  of  length  &.  Hence  the  Poisson  filtered  process  driving  the 

error  process  of  the  scheme  has  an  intensity  of  1/&. 

Combining  Eqs.  (4.33)  and  (4.34),  we  have 

t 

V  6 


*t  *  Pt/T  + 


(«(t;c,V)  ♦  e(t ;c.V)P? (dc ,dV) 


t  <  T 


(4.35) 


Here  <(t,;;V)  and  e(C,?;V)  are  the  average  error  response  made  according 
to  the  result  of  the  detection  indicated  by  the  mark  V,  and  3?  is  the  measure 
of  Che  underlying  error  driving  process  with  intensity  1/&. 

Define  Che  mark  V  in  a  similar  fashion  as  In  Chapter  3, 

V  •  d,r  Indicating  a  miss  in  Che  detection 

V  •  d^  indicating  a  correct  detection  of  an  incident 

V  *  indicating  a  false  alarm 


V  *  dQ0  indicating  a  correct  detection  of  no  incident 


The  distribution  of  the  mark  is  given  by 
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P(v  -  dlQ)  -  apl0(^) 

P(V  -  d  .)  -  a(l-p  Q(A)) 

(4.36) 

P(v  •  dQl)  -  <l-or)p01<A) 

p(v  "  d0o)  *  <i**><i-P0i{A» 

where  pQ1(&)  and  p ,  Q  (^ )  denote  the  probability  of  a  false  alarm  and  a 
miss  in  the  detection.  Hence  the  variance  of  denoted  by  Var^)  la 
given  as  follows 

.  T  - 

Var(C  )  -  p'  +  r  !  j  (C(t;0.V)  ♦  e(t;0,V)]<it 
V  0 

T 

•  P'/T  ♦  j  E  J  (C(t;0,V)  *  e(t;O.V)J2  dt  for  t  <  T  (4.37) 
v  0 

The  substitution  of  Eq.  (4.36)  into  (4.37)  yields 

V,r(?t)  'vtn  •  a  {api0(&)  JleU;°.dl0>  ♦  «(c;.0.dl0)Jz  dt 

0 

T  5 

♦  *<1-Pl0<6))  j*  [C(t;0.du)  +  e(t;0.du)I^dt 

0 

J.  2 

♦  (l-«)p0l(4)  ;(C(t;0,dol)  ♦  e(t;0,d0l)]idt  (4.38) 

0 

Mote  that  we  have  only  three  terms  in  Eq.  (4.38)  because  In  case  of  a 
correct  detection  of  no  Incident  (V  •  d^),  no  \ -dependent 

error,  l.e. 

6(ti°*d00)  *  •(t;0,doo)  ■  0  (4.39) 

Assuming  that  Che  average  error  made  by  a  correct  detection  is  relatively 
small,  since  a  «  l. 


For  the  t Lme- invariant  case  i.e.  che  system  function  and  observations  are 
not  time  dependent,  from  the  definition  of  C  and  e,  we  deduce  easily  from 
Eqs.  (4.15),  (4.16)  and  (4.17)  that 


e(t;0,d  .)  -  E  g°(0,U)  -  ube4C 
‘  U 

<(t;0,d  )  -  EIB2Q(P<:)'1  JCda.axp{(a  ♦  B2Q(PC)*lHt-s)}  .  gl(0,U)} 

U  o 

-  E  [-  B2Q(PC)“l]  J  du  exp( [a  +  B2Q(PC)‘l] [t-u]} 

«  0 

u  .  n 

•  j*  ds  exp{  (a-P^h  v  l)  (u-s)}g°(0,U) 


(4.41a) 


(4.41b) 


(ube4t )  (  j-‘-  )  [  1 

h  v 


a2  c  -1  -Pch‘v  *t 

(B2QPC*l+  Pch2v‘l) 


}  (4.42) 


where  u  denotes  che  mean  of  U.  For  comparison  purposes,  let  ua  norma  lire 
both  sides  of  Eq.  (4.40)  with  the  variance  of  che  Poisaon  driven  part  of 
the  system  which  is  given  as  |1] 

X 

Var(x")  -  \  J  («J(0))2dc 
0 

*  2 

-  X  |  (e(t;0td0l)J  dt  (4.43) 

0 

Therefore  from  Eqs.  (4.40)  and  (4.43),  the  error  variance  in  estimating 
the  Poisson  driven  part  of  the  system  normalised  to  the  variance  of  that 


T 

l 


j- n  A  V«<V  -  Pc 
V«r(C  )  -  - - - 

C  Var(x") 


p  (b\ 


J  (C(c;0,d0  )  +  e(C;0.d  ) J^dt 


F(t;0,d0l)J‘dt 


\  tends  to  zero,  4  *♦  •  end  T  ■  therefore 


P  .(A)  j.  lC(c;°.doi>  ♦•(C;0.dol)]  dt 
U*  Ver(g")  -  11*  - 

\-»0  T,A-»»  ,  .  ,2 . 

[C(t;0,d01>]  dt 


_  '  <tt  ♦  «f)‘<it 

..  *01™  0  c  c 

-  U«  — -  ~T  T -  <*• 

rt2dt 

-  t 
0 

The  probability  of  wrong  detection  In  en  Interval  of  length  T,Pqj(T),  Is 


given  by  (see  (10,12)) 


P0l(T)  -  erfc( 


<"<>■'“>  .  vEr 
^  2 


where  v Is  the  normalised  energy  of  the  signal  for  detection,  and  Is 
easily  bounded  for  our  sequential  scheme  by 

^  *  v'1  J  (g°(0)  ♦  g* (0) )*dt 
0 


v‘l  r  ♦  g!>2dt 


(* 


Therefore,  che  asymptotic  performance  of  our  scheme  in  terms  of  the 
normalized  error  variance  is,  from  Eqs.  (4.34)  to  (4.47) 


/r-  («: 


^  Var<eJ)  -  I  lia  «rfc(^^  +  -J1  ]  % 
3  T  ■*  ®  .  E_  2  P  , 


ec)2dt 


«!  dt 


(4.48) 


Note  that  in  arriving  at  Eq.  (4.48),  we  have  invoked  no  assumption  on  the 
stability  of  the  system.  For  stable  system  where  the  system  matrix  has  all 
the  poles  on  the  left-half  plane,  or  in  the  scalar  case  a  <  0,  all  the 
quantities  on  the  right  side  of  Eq.  (4.48)  can  be  shown  to  be  bounded  and 
hence  che  asymptotic  limit  as  T  -•  *  always  exist.  Since  if  a  <  0, 

•  v 

j  C2  dt  <  •  ;  erfc(^n^°)  «•  — ]  <  •  ;  vl~  <  •  (4.49) 

0  v*. 


Now,  at  first  glance  che  term 

m 

♦  «,)2de 

i  1  e 

appears  to  be  possibly  unbounded,  since  ef  (Eq.  (4.4»2)  contains  an  exponential 

term  exp{(a  ♦  B2QP‘'*l)t).  which  can  still  be  unstable,  i.e.  a  ♦  B2QPC"1  >  0. 

However,  this  is  the  eigenvalue  of  che  non-causal  smoother  through  which  e( 

2  c  *  1 

is  generated.  Here,  even  for  the  case  a  t  B  QP  >  0,  ii  still  always 

bounded  due  to  the  end-point  condition,  j^T  •  of  the  smoother.  For 

unstable  system,  a  >  0.  and  the  limit  of  the  error  variance  becomes 

T  , 

J7T  !  C c  +  V  dt 

lim  Var(C")  -  ^  ltm  erfc  «.  -~I  J  - - 

v-o  t-  itiit 

*  c 
0 


(4.50) 
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Using  an  exponential  bound  for  erfc(.)  and  substituting  £q.  (4.47)  into 
(4.50),  we  have 


•I  T 


lim  Var(t”)  <  ^  iim  exp(-  '  (t  +g*)2dt) 

X.  a  t  <•*  —  O  v  C  C 

•♦0  T  o 


(C£  +«c)  dc 


T  , 
f  e2dt 

«  t 

o 


Application  of  the  L'apital's  rule  for  the  limit  consecutively  yields 

a  >2 

lim  Var(«")  <  lim  - f— -  -  0  (4.51) 

C  —  3  ,  -IT 


A.-*  0 


T--  -2 


ct  exp(T~  -  +  8tl)"dt) 


The  result  in  £q.  (4.51)  is  not  too  surprising  because  in  detecting  a 
signal  with  increasing  power  over  a  semi- inf lnite  Interval,  the  signal-to- 
noise  ratio  goes  to  infinity  rendering  correct  detection  and  subsequently 
precise  estimation  with  probability  one. 

Hence  in  our  sequential  approximation  scheme,  for  unstable  systems, 
it  is  better  to  keep  estimating  all  the  previous  incidents.  It  should  be 
noted  chat  in  Approximation  II,  Che  error  resulting  from  the  finalized 
estlauites  of  Che  incidents  will  propagate  thus  degrading  the  performance 
of  Che  scheme. 

An  equalizing  approach  to  eliminate  the  growing  error  propagation 
is  developed  in  the  next  section.  With  this  additional  procedure,  the 
instability  of  the  system  in  many  cases  while  not  giving  rise  to  a 
unstable  error  propagation  from  the  finalized  estimates  can  in  effect 
Improve  the  estimation  and  detection  performance  if  the  system  state 
contains  sufficiently  large  energy  in  the  estimation  and  detection  interval. 
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4 . 5  The  Error  Propagation  and  Compensation  Approach 

In  this  section,  we  examine  the  effect  of  the  error  due  to  the 
deviation  of  the  finalized  estimates  of  the  incidents  on  subsequent 
estimations  and  detections  in  the  sequential  scheme.  Due  to  the  sequential 
structure  of  the  scheme,  the  error  arising  from  the  finalized  estimates 
will  exhibit  a  propagation  effect  on  latter  estimates  and  detections, 
degrading  the  overall  performance  of  the  scheme.  This  is  especially  the 
case  when  the  approximations  (I)  and  (II)  of  Section  4.2  are  adopted. 

For  an  unstable  system,  the  errors  individually  grow  with  time  and 
tend  to  accumulate  and  propagate  through  the  subsequent  intervals.  For 
stable  systems,  a  single  error  tends  to  fade  out  with  time.  But  the 
propagation  effect  due  to  it  is  quite  nominal  to  continue  for  a  relatively 
longer  duration  because  of  its  generating  phenomenon.  If  the  time  for 
the  resultant  propagation  effect  to  fade  is  long  enough,  the  errors  will 
accumulate  and  reach  a  perpetual  propagation  effect. 

Although  our  approximate  expression  for  the  asymptotic  performance  for 
X  -  0*  derived  in  section  4.4  is  quite  acceptable,  it  is  intuitively  clear 
that  the  error  propagation  effect  will  drastically  degrade  the  performance 
as  \  Increases  from  0. 

Here  we  propose  a  rather  simple  and  robust  approach  to  overcome  Che 
effect  sequentially  by  a  compensating  scheme.  More  precisely,  it  will 
estimate  the  error  due  to  the  previous  finalized  estimate  over  the  interval 
over  which  the  estimation  and  detection  of  the  incidents  are  being  processed. 
By  taking  away  the  errors  due  Co  the  finalized  estimates  sequentially,  the 
propagation  effect  is  eliminated,  and  sequentially  we  have  a  new  smoothing 
problem  over  a  new  single  interval  of  length  IU  each  time. 


To  investigate  the  error  propagating  pattern  in  the  SSS,  let  us 


assume  that  we  have  Just  applied  the  scheme  over  the  interval  (0,KA)  and 


obtained  the  estimates  ',u  and  the  estimate  of  the  system  state  based  on 


the  observations  over  (O.KA).  Now  we  finalize  the  incident  over  (0,A) 


(assumed  exists)  and  take  in  a  new  subinterval  of  observation  of  length 


A;  apply  the  scheme  and  obtain  another  set  of  estimates  ^,u  over  the  new 


It  should  be  clear  that  even  if  we  consider  the  entire  interval 


the  estimates  of  the  incidents  and  their  marks 


should  be  more  or  less  as  ^,u,  especially  the  ones  occurred  at  earlier 


time.  In  a  lot  of  cases,  the  estimates  of  the  early  incidents  and  marks 


are  identically  the  same,  since  the  additional  subinterval  of  observation 


of  length  A  is  unlikely  to  improve  those  estimates.  Hence  it  is  quite  safe 


to  assume  for  all  practical  purpose  that  over  the  subinterval  (A,2A) 


Indeed  from  Eq.  (4.21) 


see  that  (▼ ,u)  has  to  satisfy  the  following 


In  particular,  we  have  from  causality  of  g 


are  two  constants  and  should  be  closed  to  zero.  Similarly 


denoting  the  estimates  of  incident  in  (4,24) 


where  5.  and  5  are  two  constants  close  to  zero.  But  z  -  z  *8,  (▼!*“]) 


Hence  we  have 


Comparing  Eq.  (4.54)  and  Eq.  (4.56),  we  assume  that  '  ,u  are  close  to 


t2,u2  for  all  practical  purposes. 

Note  that  the  two  sets  of  estimates  of  incidents  and  marks  over  the 


subinterval  (4,24)  are  very  close  does  not  imply  that  we  do  not  have  errors 
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in  che  estimates.  Ic  simply  says  chat  over  Che  incervals  (0,KA)  and 
[A, (K -f l)A) ,  Che  MAP  estimates  of  Che  incidenCs  and  marks  over  (A,2A)  are 
very  close.  Accually,  Che  uncercainCy  of  Che  incidenCs  and  marks  is 
discribuced  in  an  MAP  fashion  over  Che  escimaCes  of  Che  incidenCs  in  Che 
encire  incerval.  Finallz. ig  Che  firsc  estimate  or  not  does  noc  affect 
very  much  Che  escimace  of  che  incidenC  righc  afcer  ic  since  chey  have  been 
escimaced  together  and  shared  Che  uncercainCy  accordingly. 

Hence  finalization  of  escimaces  of  incidenCs  and  marks  does  noc  affecC 
much  che  subsequenC  estimation  and  dececcion  of  incidenC  righc  afcer  ic 
provided  K  is  reasonably  large.  Neverche less ,  the  finalization  of  estimates 
of  an  incident  does  introduce  an  error  function  into  Che  subsequenC  sub- 
incervals.  Wich  che  reconstruction  scheme  of  using  a  filter  (4.12)  and  a 
smoother  (4.19),  Che  resultanC  error  can  be  sufficiently  large  ChaC  should 
noc  be  neglecced  wich  che  consideration  Chat  ic  would  propagate  and 
perpecuace  into  che  subsequenC  Incervals. 

Unlike  che  case  ChaC  Che  finalized  escimaces  does  not  affect  coo  much 
of  che  dececcion  and  estimation  of  its  immediately  subsequenC  incident, 

Che  error  arising  from  a  finalized  escimace  of  an  IncidenC  affects  most 
Che  rec one c rue cion  of  Che  system  state  over  its  Immediately  subsequenC  sub- 
interval  in  che  SSS.  In  che  reconstruction  process,  che  smoother  (4.19)  can 
be  regarded  as  a  filter  driven  backward  in  time  over  (A,KA)  by  Che  filter 
(4.12)  output,  which  in  turn  is  driven  by  the  encire  error  function  over 
(&,K&).  Hence,  che  error  over  [&,2A)  is  a  result  of  a  cascaded  syscem  of 
filcers,  boch  of  which  in  effecc  are  driven  by  che  error  function  arising 
from  che  finalized  escimace  through  a  t tme- Interval  of  [A,KA). 


0  1 

Notice  that  wc(tq,U0^  *nd  wt^T0'u(P  *re  bot^  zero  identically  if 

rt(To,uo)  is  £ero* 

4.5.2  Compensation  Approach  to  Eliminate  Error  Propagation 
In  this  section  we  develop  a  robust  compensation  scheme  to  eliminate 
the  error  propagation  effect  in  the  SSS.  Based  on  the  results  in  4.41 
about  the  error  patterns,  we  like  to  assess  the  error  due  to  the  finalized 
estimate  of  the  incident  in  a  sequential  manner. 

Let  us  denote  the  reconstructed  estimates  of  the  system  state  X(  and 
over  (4,24)  respectively  from  the  two  sets  of  Incident  estimates  (t,u) 
and  <£,u).  Hence  from  previous  discussions  and  results  (4.52)  (4.58)  and 
(4.59), 

(XC  -  \)  -  w°(Vu0)  -  wtl(Vu0)  4  <  t  <  24  (4.60) 

*  n 

Note  that  ,  X^  are  Just  the  sequential  reconstructed  system  states,  and 
are  most  readily  available  in  the  SSS  process.  Equation  (4.60)  simply  says 
by  comparing  the  reconstructed  system  state  over  a  subinterval  sequentially, 
we  can  approximately  compute  the  error  due  to  the  finalized  estimates 
sequentially. 

It  should  be  noted  that  In  arriving  Eq.  (4.60),  ve  have  only  assumed 
that  an  additional  subinterval  observation  of  length  4  cannot  Improve  much 
about  the  estimates  In  remotely  early  sub  Intervals.  The  Information  of  the 
error  arising  from  the  finalised  estimate  is  gained  by  the  sequential  re* 
construction  process;  it  is  over  the  same  additional  subinterval  of  length 
4  that  the  information  is  gained,  but  that  subinterval  is  about  2K4  afar. 


For  unstable  systems  in  which  case  error  tends  to  grow  with  time,  a  small 
Interval  of  observations  will  give  considerable  information  when  is  far 


74 


For  our  compensation  purposes,  let  us  consider  the  first  order  effect 
compensation  only  (higher  order  compensation  can  easily  be  extended).  Since 
by  definition,  the  error  function 

rt(To,uo)  *  *t(,,’u>  '  <4‘6l) 

which  can  be  expressed  as  follows 

reCT0-uo)  "  t*t*To,wo*  ‘  »?<Vo” 

*  «t<W1(lTo>  *  °<‘V 

‘^•tVvuS1*0^  <4-62) 

Putting  Eq.  (4.62)  into  Eq.  (4.58),  (4.59)  and  (4.60)  we  have  a  relation¬ 
ship  which  can  be  algebraically  represented  as 

<XC  -  Xt>  *  VVUqHAt)  ♦  rrQ, u0)(Au0)  (4.63) 

where  w^  and  w^  are  the  appropriate  expressions.  A  simple  computation 
method  to  evaluate  (AT)  and  (Au)  la 

2A  2A  lb 

f  (Xt  -  *  )dc  -  (AT)  J  ^r0.u0)dt  ♦  (Au)j*  wt(T0,u0)dt  (4.6<*a) 

A  A  A 

and  similarly  we  have  -•> 

J  <\  -  -  <^>2  J  C^0.u0)dt 

A  A 

lb 

♦  (A*)(Au)  r  vtC0^0)  «<r0*u0)dt 

A 

7  2A_5 

♦  (Au)  J  w£  (r0,u0)dt  (4.64b) 
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The  cwo  Eqs.  (4.64*)  and  (4.64b)  c*n  determine  the  approximate  values  of 
(&?)  and  (Au)  and  hence  the  error  function  from  the  finalised  estimate 

(VV* 

To  sussaarize  the  compensation  approach,  we  have  developed  a  scheme  to 
learn  about  the  error  functions  from  the  finalized  estimates  of  incidents 
sequentially.  The  learning  is  done  through  the  structure  of  the  re¬ 
construction  process  which  utilizes  in  effect  an  interval  of  2145  long. 

By  compensating  the  error  function  approximately  in  a  sequential  fashion  in 
Che  SSS.  Che  error  propagating  effect  can  be  eliminated  or  at  least 
dimenished  considerably.  For  unstable  systems,  a  length  of  2IG  is  sufficient 
to  give  a  lot  of  information  about  the  error  functions,  one  would  deduce 
chat  the  compensation  approach  works  even  better  than  it  would  in  a  stable 
system. 

4.6  Performance  Simulations  of  the  SSS 

The  sequential  scheme  was  slsulated  on  a  digital  computer  to  obtain 
samples  of  its  performance  in  the  state  estimation  problem.  In  each  case, 
the  simulation  was  done  by  first  allowing  the  scheme  to  run  for  a  time 
interval  to  include  20  incidents.  Then  the  performance  was  computed  over 
the  subsequent  Interval  of  duration  30/X ,  which  on  the  average  would  allow 
about  30  Poisson  Incidents.  For  comparison  purposes,  the  time  scale  were 
chosen  so  that  in  each  case  X  was  equal  to  one.  We  chose  a  «  0.15,  and  the 
length  for  the  Integral  (K+M)  *  4.  The  system  state  was  reconstructed  with 
delay  of  2. 

In  each  example,  the  sample  performance  will  be  presented  In  form  of 
three  graphs  plotter  over  a  time  Interval  of  length  30.  The  first  graph 


will  indicate  the  normalized  error  variance  derived  analytically  i.e. 

Var(«”).  The  second  graph  will  give  the  simulated  performance  in  terms  of 
the  normalized  error  variance  of  the  SSS  without  the  compensation  strategy. 
The  third  will  give  that  simulated  performance  of  the  SSS  with  the  compen- 
sation  strategy. 

Several  examples  were  considered  as  follows: 

Example  I:  x(t)  -  -2x(t)  +  2Wc  did);  x(0)  ■  0 

y(C)  -  x(t)  ♦  v(C) 

The  mark  U  is  assumed  to  be  Gaussian  with  variance  S  and  mean  ■  (a)  5  (b)  7. 
(See  Figs.  8a  and  8b.) 

Example  II:  i(C)  •  2x(C)  ♦  2W.  *  ;  x(0)  -  0 

y(C)  -  x ( t )  *  v(t) 

The  mark  U  is  assumed  to  be  Caussian  with  variance  5  and  mean  ■  (a)  5  (b)  7. 
(See  Figs.  9a  and  9b.) 

Example  III:  x(t)  -  3x(C)  ♦  W(t)  ♦  "H (t ) ;  x(0)  -  0 

y(t)  -  x (t )  «■  v(t) 

(For  results,  see  Fig.  10.) 

From  the  simulation  study,  the  results  indicated  that  the  sequential 
approximation  scheme  did  exhibit  an  error  propagation  phenomenon  both  for 
stable  and  unstable  systems.  In  cases  of  unstable  systems  as  Example  II 
and  111  had  indicated,  the  propagating  errors  could  build  up  to  great 
magnitudes  severely  degrading  the  overall  performance  of  the  SSS.  (See 
Figs.  9a,  9b  and  10.)  The  results  In  all  the  examples  confirmed  the  ability 
of  eliminating  most  of  the  propagating  errors  of  the  compensation  strategy. 

In  the  case  of  an  unstable  system,  the  adoption  of  this  compensation  approach 


^  Uur*  9t»«  Simulated  performance  of  the  SSS  with  and  without  the  compensation 
scheme  in  comparison  to  the  analytically  derived  performance 
(Example  A  -  I  lb  ) . 
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could  la  effect  completely  eliminate  the  error  propagation  effect  despite 
chat  the  compensation  was  only  a  first  order  approximac Ion  scheme. 

U . 7  Cone lua Ion 

In  this  chapter,  we  developed  a  subopt Imal  sequential  scheme  for  Che 
estimation  problem  of  che  Poisson-Wiener  driven  systems.  It  was  a  similar 
scheme  to  that  developed  In  Chapter  3  for  the  purely  Poisson  driven  system. 
The  SSS  differed  from  the  previous  scheme  only  in  that  the  reconstruction 
was  performed  on  a  MMSE  optimal  smoother  while  in  Chapter  3  the  recon¬ 
struction  was  deterministic  sfter  che  esclmate  of  the  Poisson  process  was 
determined.  Here  we  also  Included  a  first  order  approximation  strategy 
in  the  sequential  scheme  to  suppress  che  error  propagation  effect  that 
stemmed  from  che  finalized  estimates  of  incidents. 

In  the  numerical  simulation  study,  the  ability  of  nullifying  the 
propagating  errors  in  che  sequential  scheme  in  the  compensation  approach 
was  confirmed.  Hence  with  the  compensation  strategy  included,  the  SSS 
would  perform  well  as  the  analytic  results  predicted. 
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CHAPTER  5 

EXTENSION  OF  THE  SUBOPTIMAL  SEQUENTIAL  SCHEME  (SSS) 

As  it  has  been  indicated  in  Che  derivations  of  the  SSS  in  previous 
chapters,  Che  Ray  requirement  for  applicability  of  the  scheme  is  the 
existence  of  a  closed  (finite  dimensional)  MMSE  optimal  filter  for  the 
pseudo-est iaates  of  the  system  state  conditional  on  the  a-fleid  generated 
by  the  Poisson-marked  process.  Furthermore,  Che  corresponding  likelihood 
function  for  Che  Poisson-marked  process  must  exist,  so  chat  the  detections 
and  estimations  of  the  process  can  be  performed  using  the  MAP  criterion. 

In  this  chapter,  we  shall  consider  the  state  estimation  problem  for 

a  class  of  systems,  namely  the  class  of  conditional  Gaussian  systems, 

which  readily  satisfy  the  two  requirements  above.  As  we  have  developed 
the  SSS  to  quite  an  extent  in  previous  two  chapters,  here  we  derive  only 

the  closed  filters  for  the  pseudo-estimates  together  with  an  expression 

for  the  likelihood  function  for  this  class  of  systems.  It  should  be 
obvious  that  with  these  results,  the  SSS  for  the  estimation  problem  can 
be  readily  obtained  using  the  similar  approach  as  before. 

In  order  Co  state  Che  system  in  precise  terms,  the  notation  x  -  (x^,^) 
will  be  used  to  denote  the  random  process  x^.  (<i>) ,  measurable  with  respect 
to  Let  <f.,5,P)  be  the  (complete)  probability  space  as  given,  with  a 

nondecreasing  right  continuous  family  of  sub-c-algebras  (?t),  0  <  t  <  T. 

The  system  considered  in  this  chapter  can  be  represented  by 

2 

d*c  -  {^(tAy)  +  <?2(t,y)xc)dt 5l(t,y)dWl(t)  (5.1) 


♦  3Q(  t,U)d*,,t 
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and  is  observed  through  the  following  process, 

2 

dyc  -  (^(C.y)  +  <i2(t,y)xc]dt  +  E  «t(t.y)dWt(t)  (5.2) 

Here  ^(t.5),  3t(t,5),  ^(t.5),  1-1,2  are  (measurable)  functionals, 

assumed  to  be  non-antlcipatlve  (i.e.  measurable  where  is  the  a-algebra 
In  the  space  of  the  continuous  functions  £  -  [5#,s<t},  generated  by  the 
functions  5  ,  s<t).  And3g(t,U)  is  assumed  to  be  a  measurable  function  In 
tin*  and  the  mark  U  driven  by  the  Poisson-marked  process  ",  -  ("t(U),5c). 

It  should  be  clear  that  the  random  process  1)  Is  characterized  by  the  Poisson 
incident  times  r  and  the  corresponding  mark  u.  Note  that  the  system  (5.1) 
to  be  estimated  can  depend  on  the  entire  history  of  the  observed  process. 

In  order  for  the  system  given  in  (5.1)  to  be  conditional  Gaussian,  the 
following  relations  have  to  be  satisfied  (7,8,22,23], 

(1)  for  each  C^,.  with  probability  one 


T  2 


f(Ztkt(C,5)|  ♦  |Ct(t,?)|  +  ^(t.S)  +*2(t,S)]  +32(t,U))dc  <  -  (5.3) 

b  l-l 


T  2 

(2)  (  Z  St(t.S)]  dt  < 

l-l 


(5.4) 


Inf  A.(C.S)  >  C  »  0 
c  r  1  1 


S€C. 


1  -  1,2 

0  <  t  <  T 


(5.5) 


(3)  For  any  1-1,2 

l^t(t: ,5)  -  ^(t,;)!2  <  J  |$t  - cgl 2dK(*)  ♦  L2Ut  -;tl2 

0 


(5.6) 
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fi 


*|<e.5)  <  J  (i+S‘>dK(s)  +  1^(1  +  ^) 
0 


(5.7) 


where  K(s)  is  e  nondecreasing  right  continuous  function,  0  <  K(s)  <  l. 


J*  E|C2(t.5)*t!dt  <  - 
0 


ElxJ  < 


0  <  t  <  T 


pi  J  e2(c,s)»^t  <  -}  -  i 

o 


where  mc  - 


(5.8) 


(5.9) 


(5.10a) 


(5.10b) 


In  Eqs .  (5.1)  and  (5.2)  we  also  assume  that  W^  •  (W^(t),5c),  •  (W'2(t),5t) 

are  Independent  Wiener  processes  and  the  random  initial  conditions  x^  and 
yQ  are  independent  of  and  W2» 

The  system  with  the  observations  satisfying  (5.1)  and  (5.2)  is 
conditionally  Gaussian  if  with  probability  one  the  conditional  distribution 
Fy  (a)  -  P(xQ  <  a|yQ)  is  Gaussian.  That  is, 

'r.n'Vi  —  V  •  f<v  i  V  ”  *c  i  •.!»?•»?>  <5-“> 

o  n 

is  Gaussian  almost  surely,  (7,8,22,23]  for  any  t  and  0  <  t-  <  t,...t  <  t. 

The  class  of  conditional  Gaussian  processes  similar  to  chat  given  in 
(5.1)  without  the  presence  of  the  Poisson  process  71  ■  (71^,5^.)  was  first 
formulated  and  investigated  by  Liptser  [22]  and  subsequently  was  used  for 
Che  optimal  nonlinear  filtering  problems  [7,8,22].  For  this  class  of 
processes,  the  >*1SE  optimal  filters  were  closed  due  to  Che  conditional 
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Gauss  idleness  of  che  processes.  (See  Appendix  D.)  Ic  should  be  emphasized 
chac  che  Kalman  filcer  is  an  opcimal  fllCer  of  a  parcicular  case  of  (S.l) 
when  ^  is  absent.  When  the  system  processes  are  Gaussian,  the  optimal 
filcer  as  a  resulc  is  linear,  however  in  the  conditional iy  Gaussian  case 
Che  opcimal  filcer  is,  In  general,  nonlinear  [23*26]. 

Using  che  opcimal  filters  derived  for  Chis  class  of  conditional  Gaussian 
processes,  the  pseudo-estimate  for  x  ■  0  <  c  <  T  and  its  conditional 

variance  denoted  respectively  by  che  m^  and  Pt  can  be  easily  shown  to 
satisfy  Che  following,  (Appendix  D) 

«t  -  ttP  m^  (5.12) 


such  chat 


0  o  9 i(t.y>* ,(t,y)+-5 ,(t,y)* ,(t,y)  +  P  C  (t ,y) 

-  (<7.(t.Tl.y)+(7  (t.y)m°]dt  ♦  (  -± - ^ - f - * - - - 1 

C  1  C  ^(e,y)+*‘(t.y) 


(dyt  -^(t.Tl.y)  ♦  fl.2(t,11)«°)  -  &L(c,y)0o(t,O)d1)e] 


(5.13a) 


and 


2^2(t,y)Pc  +5^(t,y)  ^(C,y)  -  (— - 

i2(t,y) 

— ) 

(5.13b) 

initial  condidons 

“o  *  E<x0lyo) 

(5.14a) 

p0  -  e((x0  -  ®0)2|y0J 

(5.14b) 

,  t 

®c  *  J  *(*.•)  ^0(*.u)<rr)> 

(5.15) 
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where  <l^(t,s)  is  the  state  transition  function  of  the  system  (5.1)  given 
the  observations  {yv;  v<t}.  ♦in  this  case  is  a  functional  of  y  ■  (yt>?t) 
since  from  Eq.  (5.1),  the  system  function^  is  •  functional  of  y. 

Hence  we  have  a  finite  dimensional  !”MSE  optimal  filter  for  the  pseudo- 
estimate  of  x  -  (xc>^c)  conditional  on  the  o-algebra  generated  by  T|  •  (“H  ,5t). 
In  order  to  derive  the  equations  for  the  detection  and  estimation  of  the 
Poisson  incidents  and  their  corresponding  marks,  we  shall  obtain  the 
likelihood  function  for  {Tc‘.  0  <  t  <  t)  .  It  should  be  obvious  that  once 
this  is  obtained,  the  incidents  and  their  marks  can  be  substituted  ini 
the  function  in  the  place  of  (T  ;  0  <  t  <  t}  ,  so  that  an  expression  for  the 
aposterlorl  probability  of  the  incidents  and  marks  can  be  obtained.  Hence 
from  now  on,  the  equations  containing  i  are  understood  to  be  functions  of 
the  Poisson  incidents  and  marks. 


Applying  the  estimator-corre lator  principle,  the  likelihood  function 
for  0  <  t  <  T}  ,  denoted  byA^l^l)  can  be  obtained  and  is  given  by 

(22  ,  23],  (see  Appendix  D  also) 


atpij 


expi  -  j  l 
0 


T  ^(a.y)  +  £2(*,y)»#(y,T]) 


^(s,y)  ♦  «2<s,y) 


.  T  (C-^s.y)  ♦  G2(s.y)mt(y,Tl)]2 

T  ,  j  ^  *  < 

0  *J(s,y)  +fi2(s,y) 


(5.16) 


where  we  denote  the  dependence  of  n>t  on  y,T|  by  mt(y,T)).  Hence  an  equivalent 
expression  for  the  aposterlorl  probability  density  for  0  <  t  <  t} 

be  obtained  and  is  given  by 


can 
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T  Cj(8.y)  m  (y,T1) 

j  n]  .  J  .  -i - !_ -  dy> 

r  o  *  ^a.y)  +  ^2(*,y) 

T  »,(*.yXJ,<*,y)».(y,i))  P.(».y)»  <y,1)l!  , 

♦  J  t  -5 - - 1 - *  -? - *-j -  )  *• 

0  £7(8. y)  +  *2(*,y)  ®^(*.y)  +  *2**,y) 

+  in  Cxcn)  (5.17) 

where  we  denote  the  eprlori  probability  density  of  {i",  ;  0  <  t  <  t)  by 
f^C.).  Note  that  for  our  case  f^f)  is  a  well  defined  probability  density, 
since  1  1*  *  separable  process  (27).  In  fact,  when  {T^;  0  <  t  <  t}  Is 
explicitly  expressed  in  terms  of  Che  incidents  and  marks,  f^(")  is  Just 
a  function  of  the  Joint  probability  density  of  Che  marks  and  the  joint 
probability  density  of  the  incidents  over  (0,T). 

Similarly  as  in  Chapters  3  and  4,  the  maximization  of  will  give 

the  HA?  estimates  of  Che  incidents  and  their  marks.  For  the  reconstruction 
of  Che  system,  the  optimal  MUSE  smoother  will  be  used  and  can  be  repre¬ 
sented  as  follows  [22,23]  (see  also  Appendix  D) 

dtmt/T  "  *  ^(t.yln^/iJdt 

♦  ^(C.y)?*1^  -  ®t/T)dt 

♦  (S  *£)(t,y)0£  *£)“l(t,y){dyt  -  (C^t.y)  ♦C-l(t,y)mt/T)dt]  (5.18a) 

Pe/T  -  2(ff(C,y)  ♦  ^(t,y)P*l]Pt/T  *  -S(t,y)  (5.18b) 


®T/T  •  "V 


(5.19b) 


I 

0 

0 

II 
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Here  Che  notations  we  use  arc  as  follows 

3*6(  t.y)  -  ^(t.y)®  ^C.y)  +  ^  (t  ,y)*2  (C  ,y)  (5.20) 

and  . 

~(t.y)  -^(c.y)  -  *«)’l(t,y)C1(t.y)  (5.21) 

5(t.y)  *  3  *3  (c.y)  -  (5  *A)2  (t  ,y)  <*  *& )'l(c  ,y)  (5.22) 


The  reconstruct  ion  of  Che  system  state  x 


x 


t/T 


0  ^  l 

*t  /T  +  “t 


(xt»5c)  can  be  achieved  by 

(5.23) 


where  Is  given  In  Eq.  (5.15).  , 

To  susevarlze  this  chapter,  we  have  extended  our  original  estimation 
problem  to  a  general  class  of  conditional  Gaussian  processes.  As  a  result, 
the  SSS  can  be  similarly  developed  as  a  near-optimal  scheme  to  the  problem. 
Here  we  have  derived  all  the  basic  equations,  expressions  for  the  combined 
estimation  detection  scheme.  The  sequential  subopt laal  scheme  (SSS)  can 
be  easily  derived  based  on  these  equations  and  expressions.  While  this  Is 
conceptually  straightforward  and  highly  parallel  to  the  algorithm  In  Chapter 
4,  the  detail  development  of  Che  sequentlel  scheme  Is  omitted  here.  The 
procedures  and  equations  In  the  SSS  are  similar  to  chose  In  Chapter  4. 

In  fact  the  estimation  problems  In  Chapter  3  and  4  are  particular  cases  of 


the  one  considered  here 


CHAPTER  6 


SUMMARY  AND  CONCLUSIONS 

In  this  thesis,  the  problem  of  state  estimation  for  the  linear 
Poisson-Uiener  driven  systems  with  low  Poisson  intensity  was  considered. 

The  >«SE  optimal  filter  was  first  obtained  via  the  Dolean-Dade  and  Meyer 
differentiation  rule  for  discontinuous  semi-martingales,  and  the  basic 
filtering  theorem  for  white  Gaussian  observation  noise.  The  optimal  filter 
obtained  was  not  closed  in  the  sense  chat  it  was  the  solution  of  a  set  of 
an  infinite  number  of  stochastic  differential  equations,  unlike  the  Gaussian 
case  where  an  additional  relationship  of  its  moments  reduced  the  set  of 
equations  to  a  finite  dimensional  set, result ing  in  the  classical  Kalman 
filter. 

The  none losedness  of  the  optimal  filter  for  our  problem  was  proved 
with  the  following  two  arguments: 

(1)  If  a  similar  relationship  existed  to  reduce  the  infinite  set 
of  equations  to  a  finite  set,  then  the  solutions  to  the  set  of  equations 
were  guaranteed  to  exist  and  furthermore  were  unique,  hence  the  process 
I  -  defined  bydlt«dyt  -  hcCltdt  was  an  innovation  process  and 

therefore  a  Wiener  process.  Therefore  the  estimator  of  the  system  state 
being  the  solution  to  a  finite  set  of  equations  driven  by  a  Wiener  process 
would  have  to  be  continuous  with  probability  one.  Since  the  system  being 
estimated  was  discontinuous  with  non-zero  probability,  the  optimality  of 
the  estimator,  was  contradicted.  We  hence  concluded  that  there  was  no 
such  relationship  that  would  reduce  the  infinite  set  of  equations  to  a 
finite  set,  so  that  Che  optimal  filter  was  not  finite  dimensional. 
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(2)  By  Che  definition  of  an  innovation  process,  the  information 
generated  by  it  had  Co  be  equal  to  Chat  generated  by  the  observations. 

That  is  to  say,  the  two  c-algebras  generated  by  Che  innovstion  process 
and  the  observations  individually  had  to  be  equal  for  all  t,  0  <  t  <  «. 
However  y  •  (yt>^t)  being  an  observation  process  of  a  discontinuous  system 
in  white  Gaussian  noise  was  dlscont inuous  with  non-zero  probability, 
implying  that  9^  was  only  right -continuous  with  non-zero  probability.  In 
order  for  a  ^-measurable  process  to  be  an  innovation  process,  its  a- 

y 

algebra  was  necessarily  equal  to  and  hence  left  discontinuous  with  non¬ 
zero  probability.  This  contradicted  the  intrinsic  property  of  an  innovation 
process  being  a  Wiener  process  having  all  the  stochastic  properties  as  the 
white  Gaussian  observation  noise  end  its  7-algebra  was  continuous  with 
probability  one.  Therefore,  for  our  estimation  problem,  there  was  no  9^- 
measurable  innovation  process,  which  explained  why  the  optimal  filter 
driven  by  Che  process  I  performed  so  poorly.  The  definition  of  I  implies 
Chat  It  is  not  an  innovation  process  only  if  the  estimator  does  not 
exist  in  Che  strong  sense  as  a  solution  to  the  set  of  equations.  Hence 
Che  set  of  equations  must  be  infinite,  otherwise  would  exist  as  a 
strong  solution. 

Therefore,  the  optimal  filter  was  physically  un imp  lamentable.  Never¬ 
theless  its  performance  was  investigated  and  compared  to  that  of  the  linear 
optimal  filter,  the  Kalman  filter.  When  the  Poisson  intensity  was  low, 
the  improvement  over  the  Kalman  filter  was  negligibly  small.  Due  to  Che 
fact  that  the  linear  optimal  schemes  only  utilized  the  first  and  second 
order  statistics  its  performance  was  quite  poor.  With  the  result  that  the 
optimal  filter  performance  was  still  unacceptable,  we  therefore  concluded 
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that  causal  filtering  was  inherently  unsuitable  for  the  state  estimation 
of  the  Wiener-Poisson  driven  systems.  Since  the  linear  optimal  non-causal 
filter  also  exhibited  poor  performance,  and  the  optimal  non-causal  schemes 
were  physically  unimplementable ,  we  investigated  nonlinear,  noncausal 
suboptlmal  schemes. 

In  this  work,  a  sequential  suboptlmal  scheme  (SSS)  to  the  estimation 
problem  was  developed.  Basically  it  detected  and  estimated  the  Poisson 
driving  process  using  an  MAP  criterion,  and  then  reconstructed  the  entire 
system  state  using  a  MMSE  optimal  noncausal  filter.  In  the  numerical 
procedures  for  optimization,  the  SSS  utilized  numerical  algorithms  that 
did  not  require  differentiation  of  functions  containing  white  processes 
and  hence  avoided  the  problem  of  ill-defined  derivatives. 

In  Chapter  3,  the  estimation  problem  for  the  purely  Poisson  driven 
system  was  considered.  The  SSS  was  developed  as  a  sequential  suboptlmal 
scheme.  The  asymptotic  performance  of  the  SSS  as  tns  Poisson  intensity 
goes  to  zero  was  analytically  derived  and  numerically  simulated  on  a 
digital  computer.  Both  results  agreed  extremely  closely  indicating  that 
the  SSS  performs  very  well  and  that  there  exists  a  V  0,  such  that  for 
the  Poisson  intensity  X.  €  (0,A*J  the  SSS  performs  better  than  any  causal 
filtering  schemes  and  any  non-causal  linear  schemes. 

In  Chapter  4,  the  SSS  was  developed  for  the  Poisson-Wiener  driven 
system.  A  sequential  compensation  scheme  was  Included  in  the  scheme  to 
eliminate  the  error  propagation  effects  which  arose  due  to  the  sequential 
structure  of  the  approach.  It  was  found  that  the  propagating  errors  which 
stemmed  from  the  finalized  estimates  of  Che  Poisson  driving  process  did 
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not  strongly  affect  the  inmediate  subsequent  detection  and  estimation 
of  incidents,  but  affected  to  a  great  extent  the  performance  of  the 
reconstruction  of  the  entire  system  state.  The  sequential  compensation 
strategy  consisted  of  sequentially  comparing  a  subinterval  of  the  re¬ 
constructed  state  of  the  system  to  assess  the  propagating  error  and  then 
compensating  it  with  a  first  order  approximation  scheme  to  eliminate  the 
propagating  errors.  In  a  numerical  simulation  study,  the  SSS  was  shown 
to  perform  basically  as  predicted  by  the  analytical  results.  With  the 
compensation  strategy  added,  the  overall  performance  improved  in  general 
very  little,  only  that  the  error  propagation  effect  was  mostly  eliminated. 

In  Chapter  5,  we  extended  the  state  estimation  problem  of  Che  original 
systems  to  a  larger  class  of  systems,  namely  a  class  of  conditional  Gaussian 
systems.  This  class  Included  a  wide  variety  of  physical  systems  and 
random  phenomena.  However  each  member  of  this  class  gave  rise  to  a  MMSE 
optimal  filter  of  finite  dimensions  for  its  pseudo-estimates,  i.e.  the 
estimates  of  the  syscem  state  conditional  on  the  information  of  the  under¬ 
lying  Poisson  driving  process.  This  property  enabled  us  to  apply  the 
principle  of  the  SSS  to  the  estimation  problem.  For  the  Poisson-Wiener 
driven  linear  system,  which  was  a  particular  case  of  the  conditional 
Gaussian  systems,  the  MMSE  optimal  filters  for  its  pseudo-estimates  were 
linear.  However,  in  general  the  optimal  filters  for  such  class  of 

systems  were  non-linear. 

For  future  considerations ,  the  estimation  problem  is  to  extend  to 
larger-class  of  stochastic  systems.  Subopcimal  schemes  then  may  have  to 
be  modified  and  extended  to  solve  such  problems  and  further  improve 
performances.  Finally,  the  control  aspects  of  the  estimation  problem 
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have  Co  be  examined  Co  greaC  extent  since  escimadon  and  concrol  dually 
arise  naCurally  and  lnceracc  in  a  coupling  fashion  thac  Che  cvo  problems 
have  co  be  invescigaced  cogecher. 

Another  aspecc  of  Che  escimadon  problem  chac  is  imporcanc  and 
pracclcal  in  naCure  is  when  Che  stochasdc  syscem  is  excited  by  Wiener 
and  Poisson  white  processes  nonlinearly.  Analysis  of  chese  systems  have 
CO  be  carried  chrough  co  greaC  depth.  Approximation  scheme  as  subopdmal 
strategy  Co  Che  problem  may  have  to  be  developed  both  analytically  and 
numerically. 


T 
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APPEND LX  A:  Doleans-Dade  and  Meyer  Differential  Rule  for  Discontinuous 
Semi-Martingales  (2,19  et  al.J 

k  N 

Let  X  be  an  n-dimensional  vector-valued,  discontinuous  semi- 

l  .  * 

martingale,  and  let  2  be  a  twice  differentiable  function  defined  on  Rn. 
Then  the  process  ■  ®(Xt>  is  also  a  semi -mart ingale ,  such  that 

dQt  "  °i(Xt->dXt  +  2  trI*«at->d  <  xC'xC>tJ 

+  «*{  Z  (©(X  )  -  0(X  )  -  ® '  (X  )  CX  -X  )J} 

t0<s<t  x  s  s  s- 

Here  ®  :1a  Che  gradient  of  the  function  ®,  ®  its  Jacobian,  while  XC  is 

•*  XX 

the  continuous  part  of  X,  and  the  summation  is  carried  out  over  those 
values  of  s  where  X  Jumps.  <  N,N  >  in  the  notation  of  Wong  (20)  is  the 
increasing  process  associated  with  the  continuous  martingale  or  semi- 
martingale  N.  Note  that  this  expression  reduces  to  the  ordinary  Ito 
differencial  rule  when  Che  process  is  continuous. 
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APPENDIX  B:  Fundamental  Filtering  Theorem  for  White  Gaussian  Observation 
Noise  f 2 - 7 J 


Let  ,  t  >  tg  be  the  vector -valued  semi-martingale  defined  by 

dQ^  -  R£dt  ♦  dl^  C  -  C0 

where  M  is  a  vector-valued  martingale  with  respect  to  a  growing  family  of 
7-fields  t  >  tQ  and  R  is  a  process  that  is  adapted  to  7.  Let  further¬ 
more  the  vector  valued  process  Y  ,  t  >  tQ  be  the  semi -mart ingale  defined 
by 

dYc  -  H£dt  +  d«c  t  >  tQ 


where  H  is  another  vector-valued  process  that  is  adapted  to  7,  and  W  is 
vector- valued  Brownian  motion  and  a  martingale  with  respect  to  7,  such  that 
E (dW^  dW^)  »  V(t)dt.  The  prime  denotes  the  transpose,  and  V(t)  >  0,  t  >  t^. 
Define  5^,  t  >  t^,  as  the  growing  family  of  c-fields  generated  by  Y,  and 
assume  the  notation 

K  *  E^ci,t) 

Then,  the  process  ,  t  *  t^  satisfies  the  following  equation, 


Here 


d^  -  Rcdt  ♦  (Q^  -  ♦  £tl  V'l(t) 


[dYt  -  Hcdt), 


C  -C0 


&  d 


~  C  MC ,  W  >t 


c 

M  is  the  continuous  part  of  the  martingale  M.  If  Mj  and  H,  are  two  vector 
valued  continuous  martingales  or  semi-martingales ,  <  >  is  a  matrix 
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stochastic  process,  the  (l,J)-th  element  of  which  is  given  by  <  > 

which  is  given  as  follows 


<  Mu.  Hjj  >  4  1  t<  Mu*Mjr  Mjj  +  Hjj  >-<  HU.MU  >  -  <  >) 


in  the  notation  of  Wong. 
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APPENDIX  C:  Numerical  Algorithm  Map  A  -  M  *D  (13,14  ec  al.] 


For  Che  algorlchma  in  che  form  of  an  algorithmic  map  A  aa 
A  -  M  *D 


roughly,  after  che  map  D  determines  a  direction,  che  map  M  maximizes  the 
objective  function  J  in  that  direction.  We  choose  M  to  be  che  Golden- 
Section  Search  D,  che  conjugate-direction  method.  The  algorithmic  con¬ 
vergence  of  A  depends  on  the  closedness  of  the  point-to-set  map  A  which 
follows  from  the  closedness  of  M  and  D. 

(1)  Golden-Section  Search  iM) 

The  procedures  start  with  a  feasible  interval  I  -  containing 

the  optimal  point  - .  Then,  given  an  interval 


Jr  lr  |r 

i  -  [a;.  bJj 

containing  t,  they  determine  I*'**' [Bq4’*,  also  containing  r,  such  that 

k+l  k 

I  Cl  .  The  Golden-Seccion  search  is  for  J  concave.  It  requires  use 
of  the  Fibonacci  fractions 

I  -AjZL 


F1  “ 


0.38 


and 


F2  •  -  1  -  m  0-62 


Observe  Chat  F^  -  (Fj)  .  Given  an  Interval  1  •  (Sq.S^  of  length 
the  next  Interval  is  selected  as  follows.  Let 


and 


5i  ■  s0  *  Ft«w 


be  points  on  the  interval 
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If  J(®q)  >  J0[).  the  new  interval  is  (3q»@|J 

if  J(0q)  c  the  new  interval  is  [0^,9^ 

If  J(9q)  *  J(fl[).  the  new  interval  is  either  (0q.9[1  or  [9^,0^. 

We  are  assured  by  the  concavity  or  (local)  concavity  that  the  optimal  point 
Mn  on  the  resulting  interval.  It  should  be  clear  chat  the  algorithm  is 
closed.  Before  we  state  the  conjugate-direction  method  (D),  let  us  introduce 
a  procedure  of  simple  maximization,  a  spacer  step. 

(2 )  Spacer  Seeps 

U 

In  optimizing  J(r),  r €  E  ,  it  is  important  to  have  a  step  such  as  that 

k  k+i 

used  above  which,  given  a  nonoptlmal  point  ,  generates  a  point  ▼  for 
which 

k^.1  k 

J(I  )  >  J(L  ) 


We  call  this  step  a  space  step.  The  spacer  step  must  also  possess  one  ocher 
k 

property,  should  3.  he  a  solution,  than  the  spacer  step  much  indicate  this 

fact.  Thus  the  spacer  step  must  determine  whether  *  is  optimal,  and  if 
k 

'  is  not  optimal,  it  must  calculate  a  better  point.  For  our  purpose  we 
choose  the  eye  lie -coordinate-ascent  method,  as  it  avoids  calculation  of 
derivatives,  and  is  quite  simple  to  implement. 

The  eye lic-coordinatc-ascent  method  optimizes  cyclically  in  each  of 
the  coordinate  directions.  For  example  in  e\  the  coordinate  directions 
are 


f 
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(3)  Conjugate-Direct  ion  Method  (D) 

N 

EUsically,  Che  conjugate-direct  Iona  of  a  quadratic  function  p  in  E 
ara  a  set  of  N  directions  in  E  such  that  optimization  in  each  of  chese 
directions  once  only  will  either  find  the  maximum  or  reveal  p  to  be 
unbounded  above.  Also  the  order  of  Che  directions  optimized  is  inmate  rial. 

Since  our  objective  function  J  is  not  quadratic  in  general,  we  shall 
nevertheless  using  this  method  for  optimization  of  J  exploiting  a  quadratic 
approximation  to  J. 

N 

(i)  Generating  conjugate  directions  for  quadratic  p€E  : 

Let  d^,..*d  ,  r  <  N  be  conjugate.  Suppose  •'  and  t"  maximize 
p  in  two  different  manifolds  containing  d^,...d  and  p(r')  <  pH"'). 


W^r^ra-l 


where  d  -  t»  -  r* 
r+1 

are  conjugate.  Thus,  given  r  conjugate  directions  r  <  N,  another 
conjugate  direction  is  generated. 

(tl)  Conjugate-directions  for  J€e"  ;  we  shall  develop  an  algorithm 
which  attempts  to  calculate  conjugate  directions  for  J.  The 
procedure  can  be  thought  of  as  calculating  conjugate  directions 
to  a  quadratic  that  approximates  J.  The  quadratic  approximation 
notion  is  thus  being  utilized. 

M  directions  will  be  always  used.  Initially  chese  start  out 
being  N  arbitrary  directions  d|,d*  ...  .  At  the  end  of 

iteration  one,  the  direction  d‘  is  deleted,  the  directions 
d|,  1*1,2  ...  N-l  are  determined  by  d|  ^  *  d|  for  i*2,...N, 
and  by  making  d^  a  new  direction  conjugate  to  d^_ ^ .  The  procedure 
continues  in  the  manner. 
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Initial  Step  An  Initial  point  and  N  direction*  d‘,  1*1,2...  N 
are  given  with  d*  #  0.  Fir  i  •  1 , 2  ...  N  calculate 

0  a.  0  1 

i.e.  by  Che  Colden-section  search. 

0  0 

Define  *nd  set  k  -  l. 

k-1  k 

Iteration  k  •  TN> ,  and  d^ ,  i  •  1 , . . .  N  are  given 

k-1  k 

(a)  Spacer  step  on  yielding 

(b)  For  i  -  1,2.  ...  N  calculate  d*) 

(c)  Set 

.k  k  ,k-l 

S+i  "  -N+l  ‘  N+2 

and  calculate 
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APPENDIX  D:  Conditional  Gauesiar.  P-;?cessea 


y 


Let  an  unobservable  process  x  •  tx^.5^)  •n<1  40  observable  process 
(y^,?^)  be  processes  of  the  diffusion  type  with 


d*t  -  ItfjU.y)  ♦  <72(t,y)y)dt  ♦  ^  (t  ,y)dul  (t ) 

♦  *2<t,y)«iv2(c)  (D.l) 

<lyt  -  (C^ft.y)  C2it.y)xtJdt  *  & ,  (t,y)dWl(t) 

♦  i2(t,y)dU2(t)  (D.2) 


where  same  assumptions  and  definitions  of  the  processes  involved  in  the 

equations  are  made  as  Eq*.  (5.2)  to  (5.10). 

We  have  the  following  results  (7, 8, 21*25) 

(l)  If  the  conditional  distribution  ?  (a)  •  P(x  <  a, y.)dy  is  Gaussian. 

Yq  u  u  s 

Chen  the  random  process  x  «  (x£,?t)  1»  conditional  Gaussian  almost 
surely.  That  is,  for  any  t  and  0  <  tQ  <  t^  <  ---  t^  <  t,  the 
conditional  distributions 


r«*'  0 


**  9n>  *  *<*«■  1  9f 

n  t  i 


xf  <  •  | yj) 

w  "  n  i 


(D.3) 


(2) 


is  Gaussian. 

Let  y  •  b#  another  process  satisfies 

dyt  •«l(t.y)dWl(t)  ♦  *2(t,y)dW,<t) 

and  let  and  be  two  measures  corresponding  to  the  processes 
and  y  defined  by  Eqs.  (D.2)  and  (D.4). 

Then  is  equivalent  to  a.  l.e. 

u  —  tv 

y  y 

and  the  density 


(0.4) 

y 


(0.5) 


>t(y>  - 


dw 


(t.y) 


(0.6) 
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Is  given  by 


»t(y>  -  «*Pi  -  J  l  jy 
0 


t  ^(s.y)  ♦  ^(S.y)  m#(y) 


)  dy. 


6l{».y)  +  i2(s,y) 


,  l^,(«.y)  +tf,(*.y)  «  (y)l 

♦  r  - ±5 - % - S -  da] 

o  ®j(«.y)  ♦  *2(«.y) 


(D.7) 


where  *4<y)  -  t{  xjf*}  (D.8) 

(3)  The  estimate  of  x  end  let  variance  denoted  by  s>c  and  satisfy 

<*»c  *  (^(t.y)  ♦  tf2(t.y)®t J  «*t 
s  **(t,y)  ♦  PC. (t,y) 

♦  - PI(t','y) — 1 -  ^dyt  *  -^2(t,y)“t J  dt^  (D‘9) 


Pt  •  2£72(t,y)?t  ♦-?  *5(t,y) 


(5  **(t.y)  +Pt^2(t,y))2 
A  *fi(t.y) 


(D. 10) 


subject  to  the  conditions 

*0  "  E^xolyO)  (D*  11) 

P0  ’  ElVo*2^1  »•«) 


(4)  And  the  smoothed  estimate 

Vt  *  E(\l,t> 

and  its  variance  denoted  by  satisfy 


dc*e/T  *  ♦  ^2(t.y)mc/T}dc  ♦  5(c,y)p;l[«t-  »t/T 

+  5*6(t.y)(li  *«)'l(t,y)(dyt  -  [^(c.y)  ^(c.y)* 

Pc/T  *  .y>  +5(C.y)P*1}  Pc/T  -3(C.y) 

where 

S**(C,y)  •Sl(  t.y)&l(t,y)  +  5,  (t  ,y  (t  ,y) 

and 

£<t.y>  -  tfjU.y)  -  <?**)<c.y)(*  **)'l(t.yXT2(c,y) 

5(t,y)  *  5  *5(t  ,y)  -  (3  *4  )2  (t  ,y)  (£■  ** )" 1  (t  ,y ) 
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